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ABSTRACT 

We present a new approach to constrain galaxy physical parameters from the combined inter- 
pretation of stellar and nebular emission in wide ranges of observations. This approach relies 
on the Bayesian analysis of any type of galaxy spectral energy distribution using a compre- 
hensive library of synthetic spectra assembled using state-of-the-art models of star formation 
and chemical enrichment histories, stellar population synthesis, nebular emission and atten- 
uation by dust. We focus on the constraints set by 5-band ugriz photometry and low- and 
medium-resolution spectroscopy at rest wavelengths A=3600-7400 A on a few physical pa- 
rameters of galaxies: the observer-frame absolute r-band stellar mass-to-light ratio, M*/£ r ; 
the fraction of a current galaxy stellar mass formed during the last 2.5 Gyr, /sfh; the specific 
star formation rate, ipg; the gas-phase oxygen abundance, 12 + log(0/H); the total effective 
V^-band absorption optical depth of the dust, fy; and the fraction of this arising from dust in 
the ambient interstellar medium, /i. Since these parameters cannot be known a priori for any 
galaxy sample, we assess the accuracy to which they can be retrieved from observations by 
simulating 'pseudo-observations' using models with known parameters. Assuming that these 
models are good approximations of true galaxies, we find that the combined analysis of stel- 
lar and nebular emission in low -resolution (50 A FWHM) galaxy spectra provides valuable 
constraints on all physical parameters. The typical uncertainties in high-quality spectra are 
about 0.13 dex for M*/£ r , 0.23 for / SFH , 0.24 dex for rp s , 0.28 for 12 + log (O/H), 0.64 for 
fy and 0.16 for [i. The uncertainties in 12 + log (O/H) and fy tighten by about 20 percent for 
galaxies with detectable emission lines and by another 45 percent when the spectral resolution 
is increased to 5 A FWHM. At this spectral resolution, the analysis of the combined stellar 
and nebular emission in the high-quality spectra of 12,660 SDSS star-forming galaxies us- 
ing our approach yields likelihood distributions of M*, 12 + log (O/H), fy and ipo, similar to 
those obtained in previous separate analyses of the stellar and nebular emission at the original 
(twice higher) SDSS spectral resolution. Meanwhile, rest-frame ugriz photometry provides 
competitive constraints on M*/L r . We show that the constraints derived on galaxy physical 
parameters from these different types of observations depend sensitively on signal-to-noise 
ratio. Our approach can be extended to the analysis of any type of observation across the 
wavelength range covered by spectral evolution models. 
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1 INTRODUCTION 

The light emitted by stars and the interstellar medium in galax- 
ies contains important clues about the physical processes that drive 
galaxy evolution. Different strategies have been adopted to sur- 
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vey the spectral energy distributions of large samples of galaxies 
to gather these clues. High-quality photometry from surveys such 
as the Sloan Digital Sky Survey (SDSS, |York et al.|[2000)>, the 
Canada-France-Hawaii Telescope Legacy Survey (CFHTLS, Gora-| 
nova et al-l|2009[> an d the Cosmic Evolution Survey (COSMOS, 



Scoville et al.|2007] > has brought important information about the 



rest-frame optical emission from millions of galaxies out to red- 
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shifts of a few. This is supplemented by medium-resolution spec- 
troscopy of over a million local galaxies from the SDSS and the 
Two-degree Field Galaxy Redshift Survey (2dFGRS, Colle ss et al.| 
|2001[ >, and low- and medium-resolution spectroscopy of tens of 
thousands of galaxies at higher redshift from surveys such as the 
VIMOS VLT Deep Survey (VVDS, |Le Fevre et 1ii~||2"005ll and 
the Deep Extragalactic Evolutionary Probe (DEEP2, [Davis et aL| 



2003 ). The interpretation of the high-quality data gathered by these 
surveys has triggered the development of sophisticated models of 



galaxy spectral evolution (e.g., Bruzual & Chariot 2003, Gonzalez 


Delgado et al.|2005||Maraston|2005||Coelho et al.| 


2007| Walcher 


et al.|2009||Vazdekis et al.|20101|Thomas et al.|201 1 


see also Char- 


lot & Longhetti 2001 ; Kewley & Dopita 20021. These models can 



help us constrain the star formation and chemical enrichment his- 
tories of galaxies through the analysis of different types of photo- 
metric and spectroscopic observations. So far, however, the rela- 
tive merits of different observational approaches to constrain basic 
physical properties of galaxies have not been quantified in detail. 

Recent studies have brought important insight into the ef- 
ficiency of different types of observations to constrain physi- 
cal quantities such as star formation history, metallicity and dust 
content from observations of the rest-frame optical emission of 
galaxies. Multiband photometry has proved valuable to constrain 
galaxy redshifts and basic quantities such as stellar mass-to-light 
ratio (provided a stellar initial mass function; hereafter IMF) and, 
when combined with rest-frame ultraviolet and near-infrared con- 
straints, star formation rate and attenuation by dust (e.g. , |Bell| 
& de Jong|[200T| |Salim et al.||2005| [20071 |Ilbert et al.||2006| 



Schaer er & de Barros||2010| but see |Wuyts et al.||2009[ |Conroy| 
et al.|2010| . Medium-resolution spectroscopy allows separate anal- 
yses of nebular-emission and stellar-absorption features, and hence, 
more accurate constraints on the star formation history, stellar and 
interstellar metallicities, attenuation by dust and nuclear activity 
(e.g., Kauffmann et al. 2003a; Brinchmann et al. 2004; Heckman 
et al. 2004, Tremonti et al. 2004, Cid Fernandes et al. 2005 , Gal- 



lazzi et al.|2005 ; Panter et al. 2007 , Wild et al. 2007 1. At low spec- 



tral resolution, the difficulty of distinguishing between the signa- 
tures of nebular-emission and stellar-absorption features, especially 
around H-Balmer lines, renders analyses more complicated (e.g., 
|Lamareille et al.|2 006). This difficulty could be overcome with the 
ability to interpret simultaneously the stellar and nebular emission 
from galaxies. In this case, the relative merits of multiband pho- 
tometry and low- and medium-resolution spectroscopy to constrain 
galaxy physical parameters would need to be reexamined. 

In this paper, we develop a new approach to interpret the com- 
bined stellar and nebular emission from galaxies in wide ranges of 
multi-wavelength observations. To achieve this, we appeal to a set 
of state-of-the-art models of galaxy star formation and chemical 
enrichment histories, stellar population synthesis, nebular emission 
and attenuation by dust. We use these models to generate a com- 
prehensive library of several million galaxy spectral energy dis- 
tributions. This library can be used to retrieve probability density 
functions of physical parameters from the Bayesian analysis of any 
observed spectral energy distribution. We focus here on the con- 
straints set by 5-band ugriz photometry and low- and medium- 
resolution spectroscopy at rest wavelengths A=3600-7400A on a 
few basic physical parameters of galaxies: the observer-frame ab- 
solute r-band stellar mass-to-light ratio (M,/L r ); the fraction of a 
current galaxy stellar mass formed during the last 2.5 Gyr (/sfh); 
the specific star formation rate (ips)', the gas-phase oxygen abun- 
dance [12 + log (O/H)]; the total effective V-band absorption opti- 
cal depth of the dust Cry); and the fraction of this arising from dust 



in the ambient interstellar medium (hereafter ISM; fi). These pa- 
rameters cannot be known a priori for any galaxy sample. To assess 
the accuracy to which they can be retrieved from different types of 
observations, we therefore simulate 'pseudo-observations' by con- 
volving the spectral energy distributions of models with known pa- 
rameters with appropriate instrument responses and then applying 
artificial noise to mimic true observations. This approach allows 
us to make accurate predictions for the optimistic case in which 
the models we rely on are good approximations of true galaxies. A 
most notable outcome of our study is that the combined analysis of 
stellar and nebular emission in low-resolution (50 A FWHM), high- 
quality (median signal-to-noise ratio per pixel S/N = 20) galaxy 
spectra provides valuable constraints on all the physical parameters 
mentioned above. We also explore how the constraints from differ- 
ent types of photometric and spectroscopic observations depend on 
signal-to-noise ratio and the inclusion of nebular emission in the 
analysis. Our main results are summarized in Table[2](Section[5](. 

We describe our approach to constrain galaxy physical param- 
eters from the combined analysis of stellar and nebular emission 
in Section [2] below. In Section [3] we investigate the accuracy to 
which the parameters M„/L r , /sfh, V's, 12 + log (O/H), ry and 
H can be constrained from different types of photometric and spec- 
troscopic observations. Then, in Section [4] we compare the con- 
straints derived from the analysis of the medium-resolution spec- 
tra of 12,660 SDSS star-forming galaxies using our approach with 
those obtained in previous separate analyses of the stellar and neb- 
ular emission. We also investigate the influence of the prior distri- 
butions of physical parameters on the retrieved probability density 
functions. Our conclusions are summarized in Section|5] Through- 
out this paper, we adopt the standard cosmology Q,m = 0.3, 
Q A = 0.7 and h = 0.7. 



2 MODELING APPROACH 

In this section, we describe our approach to assess the retrievabil- 
ity of galaxy physical parameters from the combined analysis of 
stellar and nebular emission in wide ranges of multi-wavelength 
observations. We first build a comprehensive library of star forma- 
tion and chemical enrichment histories of galaxies (Section \2.l\ . 
We appeal to state-of-the-art models of spectral evolution to com- 
pute the emission from stars and gas and the attenuation by dust 
in these galaxies (Section [2. 2\ . This allows us to assemble a large 
library of galaxy spectral energy distributions (Section [2.3fr , which 
we can use to retrieve likelihood distributions of physical param- 
eters from the Bayesian analysis of any type of observed spectral 
energy distribution (Section[2.4|(. 



2.1 Library of star formation and chemical enrichment 
histories 

We build a comprehensive library of galaxy star formation and 
chemical enrichment histories by performing a semi-analytic post- 
treatment of the Millennium cosmological simulation of Springel 
|et al.| |2005). This large-scale cosmological simulation follows the 
growth, interaction and merging history of dark matter haloes from 
redshift z = 127 to the present timerjWe adopt the simple semi- 
analytic recipes of |De Lucia & Blaizot| {2007 see also |Croton| 



1 In the Millennium simulation, dark matter is represented as a set of about 
|10 lu point particles, which interactg ravitationall y in a cubic region of side 
|500/i -1 Mpc. Individual particles have a mass of 8.6 X 10 a fe- 1 M B , and] 
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Figure 1. Example of galaxy star formation and chemical enrichment histo- 
ries inferred from the semi-analytic post-treatment of the Millennium cos- 
mological simulation: (a) star formation rate, i/>, and (b) interstellar metal- 
licity, Z, plotted as a function of look-back time, for a galaxy with a present- 
day stellar mass of 1.5 X 1O 1O M . The vertical dashed line marks the evo- 
lutionary stage at which the galaxy is looked at in Fig. [^(corresponding to 
a galaxy age of 8.4 Gyr). 



|et al.|20 06 ) to follow star formation and the associated metal pro- 
duction by gas falling into these dark matter haloes. We com- 
pute in this way the star formation and chemical enrichment his- 
tories of 500,000 galaxies with present-day stellar masses com- 
prised between M* ~ 5 x lO 9 /^ 1 Mq (corresponding roughly 
to the mass resolution of the simulation; Blaizot et al. 2004} and 
5 x 10 11 fa -1 Mq. Figure[T|shows an example of star formation and 
chemical enrichment histories of a galaxy with present-day stellar 
mass 1.5 x 1O 1O M0. The spikes in both curves reflect the various 
episodes of interactions and mergers that shaped the evolution of 
the galaxy. As shown by | Wild et al.| |2008), star formation histo- 
ries computed in this way account well for the observed properties 
of galaxies at different redshifts in the VVDS (0.5 < z < 1) and 
SDSS (0.05 < z < 0.1) samples. 

About 35 percent of the 500,000 galaxies generated from the 
semi-analytic post-treatment of the Millennium simulation do not 
form stars at z = 0. These contain a negligible amount of interstel- 
lar gas and correspond to the population of present-day early-type 
galaxies. The remaining galaxies span a relatively narrow range of 
star formation properties. Their specific star formation rates peak 
around ips ~ 0.1 Gyr -1 , with a dispersion of only a factor of 2. 
Furthermore, the interstellar metallicity Z in these galaxies corre- 
lates significantly with ?ps, in the sense that the most metal-poor 
galaxies form stars most actively. The galaxies also have roughly 
similar recent histories of star formation, as quantified by the star 
formation history parameter /sfh, which we define as the fraction 
of the current galaxy stellar mass formed during the last 2.5 Gyr 
(this time lag corresponds to the main-sequence lifetime of A-F 



the detection threshold for a dark matter halo is 20 particles, i.e. about 1.7 X 
10 w h~ 1 MQ (seelSpringel et al.|2005| for detail). 



stars with strong B aimer absorption features). This parameter has 
a narrow distribution centred on /sfh ~ 0.15, with a dispersion 
of only 0.1. The distributions of ips and /sfh are compatible with 
roughly constant (overall) star formation over the past 2.5 Gyr in 
galaxies more massive than 5 x 1O 9 /i _1 M0, given that 40 per- 
cent of the stellar mass formed during this period is returned to the 
ISM in the form of winds and supernovae (e.g. Bruz uaT"& Chariot] 
2003). The above distributions of i/js, Z and /sfh are consistent 
with the observed properties of nearby, star-forming SDSS galax- 
ies (e.g. | Kauffmann et al.|20 03a, Brinchmann et al. 20d4[ |Tremonti| 
|et al.|2004) . ~ 

The relatively limited ranges of star formation histories and 
interstellar metallicities described above, while appropriate to in- 
terpret observations of nearby SDSS galaxies more massive than 
5 x 1O 9 /i _1 M0, could bias statistical estimates of ^>g, Z and /sfh 
in galaxies drawn from other samples or in different mass ranges 
(e.g. |Weinmann et al.|2006{ see also Section |4~2] below) . To avoid 
such biases, we broaden the original prior distributions of these 
parameters predicted by the semi-analytic recipes of |De Lucia &| 
|Blaizot|p007l > in 2 ways. Firstly, we widen the range in /sfh by 
randomly drawing the evolutionary stage (i.e. galaxy age) at which 
a galaxy is looked at in the library. In practice, we draw this stage 
uniformly in redshift between z — and 1.7, i.e. the upper limit 
beyond which /sfh reaches unity for a substantial fraction of the 
galaxies (Fig. |4jj below; this resampling makes the proportion of 
early-type galaxies drop from 35 to about 10 percent in the library). 
Secondly, for galaxies with ongoing star formation at the selected 
stage, we broaden the distributions of the current star formation 
rate and the current interstellar metallicity, as follows (we define 
here as 'current' the average of a quantity over a period of lOMyr 
before a galaxy is looked at). We redistribute the specific star for- 
mation rate over the interval —2 < log^s/Gyr" 1 ) < 1 using 



the probability density function p[log(^s / Gyr 



0.05[1.5 5 



|log(i/>s/Gyr 1 ) + 0.5 1 ], which is approximately uniform over 
the logarithmic range from ips = 0.03 to 3 Gyr" 1 and drops to 
zero at ^s = 0.01 and 10 Gyr" 1 (Fig.^). We redistribute the cur- 
rent interstellar metallicity Z logarithmically between 0.1 and 3.5 
times solar (Z e = 0.017). 

The random resampling of ips and Z for star-forming galaxies 
in the library is justified in part by the stochastic nature of star for- 
mation and chemical evolution in hierarchical scenarios of galaxy 
formation. Also, since the changes impact only the last lOMyr of 
evolution, they have a negligible influence on the global star for- 
mation and chemical enrichment histories derived from the semi- 
analytic post-treatment of the Millennium simulation. In return, the 
use of wider and uncorrelated prior distributions of ips and Z, to- 
gether with the broader sampling in galaxy age (and /sfh), greatly 
enhances the usefulness of the model library to explore and inter- 
pret the spectral properties of observed galaxy populations. 



2.2 Galaxy spectral modeling 

We now describe the way in which we compute the emission from 
stars and gas and the attenuation by dust in the model galaxies gen- 
erated in the previous section. The luminosity per unit wavelength 
L\ (t) emerging at time t from a galaxy can be expressed as (see, 
e.g., |Charlot & Longhetti|200T] > 



dt' ip(t - t') Sx [*', Z(t - t')] T x (t, t') . 



(2.1) 
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where ij)(t—t') is the star formation rate at time t — t', S\[t' , Z(t — 
t')] is the luminosity emitted per unit wavelength and per unit mass 
by a stellar generation of age t' and metallicity Zit — t'), and 
T\(t,t') is the transmission function of the interstellar medium, 
defined as the fraction of the radiation produced at wavelength A 
at time t by a generation of stars of age t' that is transferred by 

~^1 |2001| >, we write T x (t, t') 



the ISM. Following Chariot & Longhetti 



as the product of the transmission functions of the ionized gas, 
T+(t,t'), and the neutral ISM, Tf(t,t'), 

T x (t,t')=T+(t,t')TUt,t'). (2.2) 

In the next paragraphs, we describe the prescriptions we adopt for 
the functions S x , T+ and TP. 



2.2.1 Stellar emission 

We compute the luminosity S\(t, Z) emitted per unit wavelength 
and per unit mass by a stellar generation of age t and metallicity Z 
using the latest version of Bruzual & Chariot ( 2003 1 stellar popula- 
tion synthesis code (Chariot & Bruzual, in preparation). This code 
predicts the spectral evolution of stellar populations at wavelengths 
from 91 A to 160 and at ages between 1 x 10 5 and 2 x 10 10 yr, 
for different metallicities (from about 0.005^0 to 4Zq), initial 
mass functions and star formation histories. We use the most re- 
cent version of the code, which incorporates a new library of 
observed stellar spectra (Sanchez-Blazquez et al. 2006) and new 
prescriptions for the evolution of stars less massive than 20A/ Q 
(Bertelli et al. 2008, 2009) and for the thermally pulsating asymp- 
totic giant branch (TP-AGB) evolution of low- and intermediate- 
mass stars (Marig o et al.|2008) . The main effect of the revised TP- 
AGB prescription is to improve the predicted near-infrared colours 
of intermediate-age stellar populations (e.g., Gonzalez-Lopezlira 
et al. 2010). In all applications throughout this paper, we adopt the 
Galactic-disc IMF of |Chabrier|p003) . 



limit and greater than unity at wavelengths corresponding to emis- 
sion lines (we record the luminosities of the 109 most prominent 
emission lines at all wavelengths; we also include the recombi- 
nation continuum radiation). When computing T^(t,t') in equa- 
tions j2.1[ )-l |2"72"} , we take the metallicity of the photoionized gas to 
be the current metallicity Z(t) of the galaxy. We randomly draw 
the dust-to-metal ratio from a uniform prior distribution between 
£d = 0.1 and 0.5, and the zero-age ionization parameter from a 
logarithmic prior distribution between log Uo — —3.5 and —1.5. 
We fix all other parameters of the photoionization code at the stan- 
dard values favored by Chariot & Longhetti (2001 , see their paper 
for detail). 

In Fig. [2] we compare the luminosities of prominent optical 
emission lines obtained in this way for galaxies with different Z, 
Uo and £d, with high-quality observations of a sample of 28,075 
star-forming galaxies from the SDSS Data Release 7 (DR7; [Abaza-| 
[jian et al.|20 09). The different panels show the relations between 
different luminosity ratios constructed with the [O II] AA3726, 3729 
(hereafter [On]A3727), Hj3, [Om]A5007, Ha, [Nn]A6584 and 
[S II]AA6716, 6731 lines. We selected the star-forming galaxies of 
Fig. [2] from the SDSS DR7 by requiring high signal-to-noise ratio 
(S/N > 10) in all lines and excluding galaxies in which the emis- 
sion could be contaminated by an active galactic nucleus (AGN; 
we applied the conservative criterion of Kauffmann et al. 2003b to 
reject AGNs from the standard Baldw in et al.|198l| line-diagnostic 
diagram reproduced in Fig. |2^). The excellent agreement between 
models and SDSS observations in Fig. [2] is similar to that ob- 
tained by [ Brinchman n et al.| (2004| ) using the original [Chariot &| 
|Longhetti( i 2001) models. This allows the derivation of independent 
constraints on Z, Uo and £d from the different line luminosity ra- 
tios. For simplicity, in what follows, we adopt a fixed intrinsic line 
velocity dispersion of 100 km s -1 (typical of SDSS star-forming 
galaxies; e.g. |Brinchmann et al.|20 04) when computing the contri- 
bution by ionized gas to the emission from galaxies. 



2.2.2 Nebular emission 



A main feature of our study is that we account for the contri- 
bution by ionized gas to the emission from galaxies. We follow 
the prescription of Chariot & Longhetti (2001) to compute the 
transmission function T^(t,t') of the gas photoionized at time t 
by stars of age t' in a galaxy. This consists in adopting effective 
(i.e. galaxy-wide) parameters to describe the ensemble of H II re- 
gions and the diffuse gas ionized by young stars throughout the 
galaxy. The main adjustable parameters are the interstellar metallic- 
ity, Z, the zero-age ionization parameter, Uo (which characterizes 
the volume-averaged ratio of ionizing-photon to gas densities at age 
t' = 0), and the dust-to-metal (mass) ratio, £d (which characterizes 
the depletion of metals on to dust grains) of the photoionized gas[^] 
Following Chariot & Longhetti (2001 1, we neglect the contri- 
bution by stars older than lOMyr to nebular emission, i.e., we write 



Tt{t,t') 



for i'>10Myr. 



(2.3) 



We use the standard photoionization code CLOUDY ( |Ferland|1996) 
to compute the transmission function at earlier ages, assuming 
that galaxies are ionization bounded. For t' < 10 7 yr, therefore, 
T^(t, t') is close to zero at wavelengths blueward of the H-Lyman 



2 These effective parameters were denoted by Z, {Uo) and §d m 
|& Longhetti|(200T) . 



Chariot 



2.2.3 Attenuation by dust 

The attenuation by dust of the emission from a galaxy depends on 
several factors, including the amount and structure of the interstel- 
lar gas, the metallicity and physical conditions of this gas and the 
orientation of the galaxy. Two galaxies with identical star formation 
and chemical enrichment histories but different sizes (and hence 
different total gas masses), ISM structures and orientations can 
have widely different dust attenuation optical depths, even though 
their dust-free spectral energy distributions are the same (modulo a 
mass scaling factor). We assume here for simplicity that the dust at- 
tenuation optical depth of a galaxy does not depend on other global 
galaxy parameters, such as the age t, the specific star formation rate 
ips an d the gas metallicity Z. We therefore write the transmission 
function of the neutral ISM in equations l |2.1[ >- j2"72") as simply a 
function of stellar age t' , 



T A °(t,0=7\°(t') 



(2.4) 



In the ISM, stars are born in giant molecular clouds, which 
dissipate on a time-scale of the order of lOMyr (e.g., Blit z & Shu| 
[l980l|Kawamura et aL|2009) >. Thus the emission from stars younger 
than this time-scale and from the photoionized gas is more atten- 
uated than that from older stars. We account for this effect using 
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where ff c and f A SM are the effective absorption optical depths of 
the stellar birth clouds and the ambient (i.e. diffuse) ISM, respec- 
tively. We note that, since the intrinsic spectral energy distributions 
of young and old stars evolve in time, equation {1.6\ implies that 
the global attenuation curve of a galaxy containing several genera- 
tions of stars is complicated function of galaxy age, even though f\ 
is a simple function of t' (see, e.g., fig. 5 of |Charlot & Fall|2 000l. 

We must also specify the dependence of f " c and f A BM on 
wavelength. For stellar birth clouds, we adopt a simple power-law 
of index —1.3 appropriate for optically thick clouds with dust prop- 
erties in the middle range of the Milky Way, the Large and the Small 
Magellanic Clouds jda Cunha et al.| |2008). In the ambient ISM, 
the shape of the effective absorption curve is more uncertain, as 
the spatial distribution of dust relative to the stars can vary widely 
from galaxy to galaxy. Also, while f A SM represents an average op- 
tical depth along rays emanating in all directions from all stars, in 
practice, the attenuation will depend to some degree on the angle 
under which a galaxy is observed. Thus we write 



- BC 



u)fy (A/5500, 



]og(L[01I]/L[0III]) 



r A ISM = fiT V (A/5500, 



(2.7) 



(2.8) 



Figure 2. Emission-line luminosities of galaxies computed using the 
models described in Section |2.2.2| (lines), compared with high-quality 
observations of a sample of 28,075 star-forming galaxies from the 
SDSS DR7 (black dots). The models assume constant star formation 
over the past lOMyr. The data are corrected for attenuation by dust as 
described in |Brinchmann et al.| pOOl) . (a) L([0 III]A5007)/L(H/3) 
versus L([NlI]A6584)/L(Ha). (b) L([Olll]A5007)/£(H/3) versus 
L([Sll]AA6716,6731)/L(Ha). (c) L([Om]A5007)/L(H/3) versus 
L([Nn]A6584)/L([On]A3727). (d) L([Om]A5007)/L(H/3) versus 
L([0 II] A3727) /L([0 III] A5007). In each panel, lines of different colours 
refer to models with different zero-age ionization parameter log Uo (cyan: 
— 1.5; red: —2.0; green: —2.5; magenta: —3.0; blue: —3.5). At fixed 
log Uo, the lower dashed line corresponds to models with dust-to-metal ra- 
tio £d = 0.1, the solid line to models with = 0.3 and the upper dashed 
line to models with £,j = 0.5. Along each line, dots mark the positions of 
models with different gas metallicity Z (0.10, 0.20, 0.50, 0.75, 1.00, 1.50, 
2.00 and 3.00 times Zq, from left to right; only low-metallicity models 
are considered for log Uo = —1.5). In (a), the gray long- short-dashed 
line shows the Kauffmann et al. 1 2003b I criterion to separate star-forming 
galaxies from AGNs. 



the simpl 
transmission 



tple dust model of |Charlot feTali] (2 000), who express the 
ission function of the neutral gas a^_] 

T A °(t')=exp[-f A (0] • 



(2.5) 



Here t\(t') is the 'effective absorption' (or attenuation) optical 
depth of the dust seen by stars of age t' . This represents the av- 
erage absorption of photons emitted in all directions by stars in all 
locations within the galaxy, including the effects of scattering on 
the path lengths of the photons before they are absorbed. Follow- 
ing |ChMlotXFaIi]j2000}, we write 



*X(0 



-BC , -ISM 
T \ + T \ 
-ISM 



for 
for 



i! < 10 Myr , 
t' > 10 Myr , 



(2.6) 



3 The function denoted here by T? (t ' ) is equivalent to the function T\(t') 
defined by equation (6) of |Charlot &FaU] (2000) , for / = 0.0 in their 



In these expressions, fy = fy C +Ty 3M is the total effective V-band 
absorption optical depth of the dust seen by young stars inside birth 
clouds, and fi = fy M /(fv C + fy M ) is the fraction of this con- 
tributed by dust in the ambient ISM. A value of /i around 0.3 and a 
typical slope n = 0.7 in equation \2.fy account remarkably well for 
several observed mean relations between the ultraviolet, optical and 
infrared properties of nearby g alaxies ^Chariot & Fall 2000| |Kong| 



|eTaI1[20tj4l |Salim et al.||20T77l |da Cunha et al.||2008[ Wild et al 
2011a). More refined investigations using sophisticated dust mod- 
els suggest that, in detail, the attenuation curve should flatten when 
the optical depth increases JPierini et al.|2004| [Tuffs et al.|2004| 
|Rocha et al. 2008jl. This is supported by the typically greyer optical 
attenuation curves observed in nearby edge-one galaxies compared 
to face-on ones (Wild et al. 2011b). A recent analysis shows that 
the trend favored by current models and observations can be rea- 
sonably well approximated by a simple expression of the slope of 
the attenuation curve of the form 



2 

1 + fiTV ' 



(2.9) 



with a scatter of about 20 percent at fixed (xfv (Chevallard et al., in 
preparation). 

To compute T°(t,t') in equations |2.l[ >- {2~2"} , we therefore 
proceed as follows. We first draw fy between 0.01 and 4 from 
the probability density function p(ry) = 0.18 arctan[5(4 — tv)], 
which is approximately uniform over the range from fv = 0.01 
to 3 and drops to zero at fy =4 (see Fig. [4^ below). We fur- 
ther draw fj, from a uniform prior distribution between 0.1 and 0.7, 
and we draw n uniformly over an interval An = 0.3 centred on 



the mean value predicted by equation (2.9i. Then, we report the 
selected fv, p and n in equations l |2.4} - 2.8 1. This prescription ac- 
counts for a much broader range of plausible dust attenuation laws 
than the simple recipes usually adopted in galaxy spectral analyses 
(e.g., Kauffmann et al. 2003a; Bri nchmann et al.|2004||Cid Fernan-| 
|des et al.|2005[|Panter et al.|2007||Wuyts et al.|2009) . We therefore 
expect larger typical uncertainties arising from attenuation by dust 
than in these previous studies. 
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2.3 Library of galaxy spectral energy distributions 



We use the library of star formation and chemical enrichment his- 
tories described in Section |2~T| and the spectral models described 
in Section |2~2| to simulate a large library of galaxy spectral energy 
distributions. This will be used in the remainder of this paper to 
assess the retrievability of galaxy physical parameters from various 
types of observations. To ensure that all kinds of spectral energy 
distributions are properly sampled, for each of the 500,000 star for- 
mation and chemical enrichment histories in the original library, we 
draw 10 different realizations of the evolutionary stage (and hence 
of the parameter /sfh) and, for star-forming galaxies, of the cur- 
rent physical properties (parameters f/>s, Z, log Uq, £d, fv and m)> 
as summarized in Table [T] The final library therefore contains 5 
million models, which we can use to generate catalogs of galaxy 
spectra. 

In the present study, we focus primarily on the rest-frame op- 
tical properties of galaxies. Our main goal is to compare the con- 
straints that can be derived on galaxy physical parameters from 
different types of observations in this wavelength range. At wave- 
lengths between 3525 and 7500 A, the stellar population spectra 
computed using the models presented in Section [2.2. 1 1 have a na- 
tive spectral resolution of 2.3 A (full width at half-maximum, here- 
after FWHM), corresponding to a resolving power R ~ 2200 at 
5000 A (the resolution is coarser outside this wavelength range). 
This allows us to investigate the retrievability of galaxy physical 
parameters from 4 different types of observations at optical wave- 
lengths: 

• Medium-resolution spectroscopy ( \=3600-7400A). We adopt 
a fiducial resolution of 5 A FWHM, corresponding to R = 1000 at 
A = 5000 A, and a fixed pixel size of 2.5 A. 

• Low-resolution spectroscopy (\= 3600-7400 A). We adopt a 
fiducial resolution of 50 A FWHM, corresponding to R = 100 at 
A = 5000 A, and a fixed pixel size of 25 A. 

• Equivalent-width measurements of strong emission lines. 
We also investigate the constraints set purely by the equivalent 
widths of the following emission lines, at both low and medium 
spectral resolution: [On]A3727; H/3; [Om]A4959, [Oiii]A5007; 
[Nn]A6548; Ha; [Nn]A6584; [Sn]A6716 and [Sn]A6731. An 
advantage of equivalent widths is that they can provide useful in- 
formation even in spectra which are not flux-calibrated. 

• Broadband photometry. We adopt the SDSS ugriz filter re- 
sponse functions. 

As an example, Fig.[5]shows the spectral energy distribution of the 
same model galaxy as in Fig. [T|at the age of 8.4 Gyr (with current 
physical parameters resampled using the distributions in Table [TJ, 
computed alternatively at medium (panel a) and low (panel b) spec- 
tral resolution and convolved with the SDSS ugriz filter response 
functions (panel c). 



2.4 Retrievability of galaxy physical parameters 

The galaxy spectral library generated in the previous section can be 
used to assess the retrievability of physical parameters from differ- 
ent types of observations. To achieve this, we first draw a random 
subsample of 10,000 spectral energy distributions from the library, 
to which we apply artificial noise to mimic true observations. For 
low- and medium-resolution spectroscopy, we apply noise indepen- 
dent of wavelength (i.e. white noise), parametrized in terms of the 
median S/N per pixel, S/N. We consider the cases S/N = 5 and 
20 (for reference, the stellar population spectra computed using the 
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Figure 3. Spectral energy distribution of the same model galaxy as in Fig.fT 
at t he ag e of 8.4 Gyr, computed using the models described in SectionsITT 
and|z2] (a) At a spectral resolution of 5 A FWHM (R = 1000 at 5000 A). 
The black portion of the spectrum (A=3600-7400 A) is that used to retrieve 
galaxy physical parameters at this resolution, (b) Same as in (a), but at a 
spectral resolution of 50 A FWHM (R = 100 at 5000 A), (c) Same as in 
(a), but convolved with the SDSS ugriz filter response functions (shown 
at the bottom). The model galaxy has /sfh = 0-3 and current parameters 
(resampled using the distributions in Table ^ i/ig = 0.08 Gyr -1 , Z = 
OAZq, log U = -2.8, £ d = 0.3, t v = 1.0, u = 0.6 and n = 0.7. In 
(a) and (h), the adopted pixel size is half the size of a resolution element. 



models presented in Section 2.2.1 have a native S/N of about 95). 
For multiband photometry, instead, we take the signal-to-noise ratio 
to be the same in all wavebands (S/N = 10 or 30). In the following, 
we refer to this subsample of 10,000 noisy spectral energy distri- 
butions as 'pseudo-observations', and to the 4,990,000 remaining 
spectral energy distributions in the library as simply 'models'. 

We use a Bayesian approach to quantify the accuracy to which 
several physical parameters of these pseudo-galaxies can be re- 
trieved from different types of observables. The likelihood of the 
jth model, noted A4j, given the spectral energy distribution of a 
pseudo-galaxy can be written 



\nP{M j \{O i })=-\Y J 



O, - wjMj,^ 2 



(2.10) 



where Oi and Mj,i are the spectral energy distributions (i.e., either 
the spectral flux density at the i th wavelength, or the equivalent 
width of the i th emission line, or the flux in the i th photometric 
band; see Section [2~3) l of the pseudo-galaxy and the j th model, re- 
spectively, Oi is the uncertainty in Oi corresponding to the adopted 
signal-to-noise ratio, and 



y-y OiMj,i 



(2.11) 



is the model scaling factor that maximizes P(A4j j{0; }) (this pa- 
rameter is unity in the case of scale-free equivalent widths). In prac- 
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Table 1. Prior distributions of the current (i.e. averaged over the last lOMyr) physical properties of star-forming galaxies in the library of star formation and 
chemical enrichment histories assembled in Sections l2. 1 I and l2~2l 



Parameter 


Description 


Range 




Probability Density Function 




Specific star formation rate 


-2 < log^s/Gyr- 1 ) < 1 


0.05 


1.5 s - log^s/Gyr- 1 ) + 0.5 5 




z 


Metallicity of the star-forming gas 


-1 < log(Z/Z Q ) < 0.54 




uniform 




log Uo 


Zero-age ionization parameter 


-3.5 < log U < -1.5 




uniform 




Dust-to-metal ratio in the ionized gas 


0.1 < £ d < 0.5 




uniform 


TV 


Total V-band attenuation optical depth of the dust 


0.01 < iv < 4 




0.18arctan [5(4 - Ty)] 




Fraction of fy contributed by dust in diffuse ISM 


0.1 < a* < 0.7 




uniform 


n 


Slope of the attenuation curve in the diffuse ISM 


1.6 < n(l + HTv) < 2.4 




uniform 



tice, in the case of medium-resolution spectroscopy, the summation 
on the right-hand side of expression ( |2. 10[ > extends over 1520 wave- 
length points and hence generally yields values greater than 2000, 
even for a good-fit model^The absolute likelihood in this case 
cannot be evaluated numerically, as it is always rounded to 0. For 
medium-resolution spectroscopy, therefore, we rather compute the 
probability of the j th model relative to the best-fitting model, noted 
Mo, 

]nP(Mj\{Oi}) = \nP(Mj\{Oi}) - ]nP(M \{Oi}) . (2.12) 

We have checked that in the case of low-resolution spectroscopy, 
where both P(Mj\{Oi}) and P{Mj\{Oi}) can be evaluated nu- 
merically, the likelihood distributions of galaxy physical parame- 
ters obtained using equations ( |2.10} and l |2.12^ are similar. 

In this paper, we focus on the retrievability of 6 physical pa- 
rameters from the observed spectral energy distributions of galax- 
ies: 

• the observer-frame absolute r-band (stellar) mass-to-light ra- 
tio, M,/L r . This is the ratio of the galaxy stellar mass to the abso- 
lute luminosity in the observed r band; 

• the fraction of the current galaxy stellar mass formed during 
the last 2.5 Gyr, /sph; 

• the specific star formation rate, tps ; 

• the gas-phase oxygen abundance in units of 12 + log(0/H), 
where O/H is the abundance by number of oxygen relative to hy- 
drogen. In the models of nebular emission adopted in Section [2.2.2[ 
the gas-phase oxygen abundance corresponding to solar metallicity 
(Z = Z Q ) is 8.81, 8.73 and 8.63 for dust-to-metal ratios £d = 0.1, 
0.3 and 0.5, respectively. 

• the total effective V-band absorption optical depth of the dust, 
fy (Section [2,2,3| l; 

• the fraction of fy arising from dust in the ambient ISM, jj,. 

Fig. [4] shows the prior distributions of these parameters in 
the galaxy spectral library generated in Section [23] In each panel, 
the shaded histogram shows the distribution including all galaxies, 
while the solid histogram shows the contribution by star-forming 

4 We note that only a small contribution to these high values arises from our 
neglect of intrinsic correlations between spectral pixels in equation j2.10) . 
A detailed investigation reveals that such correlations are significant only 
for pixels associated with emission lines, which reflects the fact that all 
emission lines arise from the same ionizing radiation. We have used a test 
sample of 3000 pseudo-galaxies to check the effect of accounting for pixel- 
to-pixel correlations on the calculation of model likelihoods. This requires 
the introduction of the covariance matrix in equation (2.10) and is compu- 
tationally very demanding. The result is a negligible effect on the computed 
model likelihoods. Hence equation (2.10[ should be appropriate to compute 
model likelihoods. 



galaxies alone. Galaxies without star formation have typically high 
M„/L r in Fig. |4ji and /sfh ~ in Fig. [4jj. These galaxies are 
off scale (at logo's = — oo) in Fig. |4j;. They do not contribute to 
the histograms in Figs |4jl-f, which pertain to interstellar param- 
eters. As emphasized in Sections |2.1| and |2.2| the prior distribu- 
tions of Fig. [4] are as flat as possible to avoid unwanted biases in 
the retrieval of physical parameters from observed spectral energy 
distributions of galaxies. This choice is motivated by the desire to 
make the spectral library appropriate for the analysis of galaxies 
in wide ranges of physical properties (we note that the finite ex- 
tent of the prior distributions in Fig. [4] could still bias analyses of 
galaxies with extreme parameters). Alternative prior distributions 
can be designed to help reduce the uncertainties in the parame- 
ters derived for galaxies with known specific physical properties 
(e.g., actively star-forming, early-type, metal-poor). We return to 
the influence of prior distributions on parameter determinations in 
Section [4~2| below, where we investigate the results obtained using 
the library of galaxy star formation and chemical enrichment histo- 
ries derived from the original semi-analytic recipes of |De Lucia &| 
|Blaizot| ( [2007l > . 

In Fig. [5] we show examples of the probability density func- 
tions (likelihood distributions) of physical parameters retrieved, us- 
ing equations jO0) - j2TT) and the prior distributions of Fig. [4] 
from the low-resolution, high-S/N spectrum of a pseudo-galaxy in 
our library. This pseudo-observation was obtained by adding arti- 
ficial noise (S/N = 20) to the model spectrum of Fig. [3]}. We take 
the 'best estimate' of each parameter in Fig.[5]to be the median of 
the retrieved probability density function and the associated 'un- 
certainty' to be half the 16th-84th percentile range (this would be 
equivalent to the ±l<r range in the case of a Gaussian distribution). 
To quantify potential biases in the retrieved likelihood distribution, 
we further define the 'accuracy' as the absolute difference between 
the retrieved best estimate and true parameter value. In this exam- 
ple, all the parameters are well retrieved. The success in recover- 
ing the fraction /sfh of intermediate-age stars (Fig. |5j:) and the 
interstellar parameters 12 + log (O/H), fy and ^1 (Figs [5£-[5]*) is 
remarkable. It is the first illustration of the power of our approach 
to exploit strong absorption- and emission-line signatures in noisy, 
low-resolution spectra. We examine this property in detail in Sec- 
tion l3.2l below. 

It is important to stress that the approach adopted here to as- 
sess the retrievability of galaxy physical parameters from differ- 
ent types of observations, while idealized, is as realistic as can 
be achieved using modern techniques. Pseudo-observations are re- 
quired to test the retrievability of physical parameters from observ- 
ables, because the physical parameters of observed galaxies cannot 
be known a priori. The fact that we draw pseudo-observations from 
the same spectral library as the models used to analyze them makes 
the constraints derived on the retrievability of physical parameters 
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Figure 4. Prior distributions of selected physical parameters of the 5 mil- 
lion galaxies in the spectral library generated in Section [23] (a) observer- 
frame absolute r-band stellar mass-to-light ratio, M*/L r ; (b) fraction of 
current stellar mass formed during the last 2.5 Gyr, /sfh; ( c ) specific star 
formation rate, -0s; M gas-phase oxygen abundance, 12 + log(0/H); (e) 
total effective V-band absorption optical depth of the dust, fy ; (/) fraction 
of ry arising from dust in the ambient ISM, fi. In each panel, the shaded 
histogram shows the distribution for all galaxies, while the solid histogram 
shows the contribution by star-forming galaxies alone. Non-star-forming 
galaxies are off scale (at log i/>s = — oo) in panel (c) and do not contribute 
to the distributions of interstellar parameters in panels (d)-(f). 
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Figure 5. Probability density functions of the same physical parameters as 
in Fig. [4] retrieved, using the Bayesian approach described in Section [2~4] 
from the pseudo-galaxy spectrum shown at the top. The spectrum was ob- 
tained by adding artificial noise with S/N = 20 to the R = 100 model 
spectrum of Fig. [5j). In each panel, the black triangle indicates the true value 
of the parameter of the pseudo-galaxy, the solid line the best estimate (50th 
percentile of the retrieved likelihood distribution) and the dashed lines the 
associated 68-percent confidence interval (16th and 84th percentiles). 



optimistic: our results correspond to the best possible case in which 
the models are a realistic representation of true galaxies. In Sec- 
tion [4] below, we also illustrate the constraints that can be derived 
by applying our approach to interpret a sample of observed galaxy 
spectra. 



3 CONSTRAINTS ON GALAXY PHYSICAL 

PARAMETERS FROM DIFFERENT TYPES OF 
OBSERVATIONS 

In this section, we use the modeling approach described in the pre- 
vious section to quantify the accuracy to which physical parame- 
ters of galaxies can be retrieved from different types of observa- 
tions. We focus on the retrievability of the 6 parameters introduced 
in Section l2l4l the observer-frame absolute r-band stellar mass-to- 
light ratio, Al*/L r ; the fraction of the current galaxy stellar mass 
formed during the last 2.5 Gyr, /sfh; the specific star formation 
rate, tps \ the gas-phase oxygen abundance, 12 + log (O/H); the to- 
tal effective V^band absorption optical depth of the dust, fy; and 
the fraction of fy arising from dust in the ambient ISM, /1. In Sec- 
tion |3.1| below, we first examine the constraints set on these param- 
eters by multi-band photometric observations. We consider the in- 
fluence of several factors, such as the signal-to-noise ratio, the num- 
ber of photometric bands, the contamination of broad-band fluxes 
by nebular emission lines and the uncertainty introduced by the 
lack of redshift information. We find that photometric observations 
can provide useful constraints mainly on M,/L r and ips- Then, in 



Section |Jl2| we quantify the improvement that can be achieved in 
the constraints on all parameters by appealing to different types of 
spectroscopic observations. 

3.1 Constraints from multi-band photometry 

3.1.1 Nearby galaxies 

We first explore the extent to which physical parameters of nearby 
galaxies can be constrained using 5-band ugriz photometry. We 
extract 10,000 spectral energy distributions from the library assem- 
bled in Section [23] to compute pseudo-observations of galaxies in 
these bands, as described in Section |2.4| We adopt for the mo- 
ment a signal-to-noise ratio S/N = 30 in all bands and place all 
spectral energy distributions at z = 0. The rest of the 5 million 
models in the library allow us to retrieve likelihood distributions 
of M,/L r , /sfh, ips, 12 + log (O/H), fy and /1 for each of these 
10,000 pseudo-galaxies, by means of equations l |2.10[ l- p.l l\ . In 
the following, we refer to this setting as the 'standard' case. 

Fig.[6]summarizes the constraints that can be set on the physi- 
cal parameters of nearby galaxies using high-quality, 5-band ugriz 
photometry. For each parameter, we plot the average probability 
density function retrieved from the above analysis as a function of 
true parameter value. This average was obtained by coadding and 
then renormalizing the probability density functions retrieved in 
narrow bins of true parameter value for the 10,000 pseudo-galaxies 
in our sample. The solid line in each panel of Fig.|6]locates the me- 
dian (i.e. best estimate; Section [2~4] l of the average likelihood dis- 
tribution in each bin, the dashed lines the 16th and 84th percentiles 
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Figure 6. Average probability density functions of the same 6 physical parameters as in Fig.[4]retrieved, using the Bayesian approach described in Section |2~4] 
from 5-band ugriz photometry with S/N = 30 of a sample of 10,000 pseudo-galaxies (standard case). Each panel corresponds to a different parameter [from 
left to right: M*/L r , /sfh, V'Sj 12 + log (O/H), fy and fi\. In each case, average probability density functions in 50 narrow bins of true parameter value 
were obtained by coadding and then renormalizing the probability density functions retrieved for the 10,000 pseudo-galaxies (see Section [3. 1.1| for detail). 
Grey levels locate the 2.5th, 12th, 16th, 22nd, 30th, 40th, 60th, 70th, 78th, 84th, 88th and 97.5th percentiles of the average likelihood distribution in each bin. 
The solid line locates the associated median (best estimate) and the 2 dashed lines the 16th and 84th percentiles (68-percent confidence interval). For reference, 
3 dotted lines indicate the identity relation and deviations by a factor of 2 (±0.3 dex) between the retrieved and true parameter values. 



(68-percent confidence interval) and the outer edges of the lightest 
grey level the 2.5th and 97.5th percentiles (these would correspond 
to the ±2<r levels in the case of a Gaussian distribution). For refer- 
ence, 3 dotted lines indicate the identity relation and deviations by 
a factor of 2 (±0.3 dex) between the retrieved and true parameters. 

Fig. [6] shows that the r-band stellar mass-to-light ratio of 
nearby galaxies can be reasonably well constrained using 5-band 
ugriz photometry. The retrieved best estimate is in excellent agree- 
ment with the true parameter value over most of the explored range, 
and the associated uncertainty, defined as half the 16th-84th per- 
centile range (Section |2.4[ >, is only about 0.1 9 dex (this number 
would be larger if we allowed for IMF variations). At the extremi- 
ties of the range, the relative deficiency of galaxies with very small 
and very large M*/L r in the spectral library (Fig. [4^) slightly 
biases the retrieved likelihood distributions toward intermediate 
M„/L r values. Fig. [6] also shows that, in contrast to M*/L r , the 
fraction /sfh of stellar mass formed in the past 2.5 Gyr cannot be 
well retrieved from 5-band ugriz photometry. This is because the 
strong absorption features of intermediate-age stars have a negli- 
gible influence on broadband fluxes. The signatures of /sfh on 
broadband colours are otherwise difficult to disentangle from those 
of stellar metallicity and attenuation by dust. 

The constraints on specific star formation rate from 5-band 
ugriz photometry in Fig. [6] are meaningful only for actively star- 
forming galaxies, i.e. for ips > lGyr -1 , and the associated un- 
certainty is at best 0.30 dex. In such galaxies, the contamination 
of broadband fluxes by strong emission lines produces identifiable 
signatures ([O II] in the u band; H/3 and [O III] in the g band; Ha, 
[Nil] and [S II] in the r band). Even then, the similar colours of 
galaxies with both low ips and fv and those with both high tps 
and fy- give rise to long tails toward low in the retrieved like- 
lihood distributions. This degeneracy is illustrated by the overlap 
of galaxies with different tps in Fig [7^, which shows the g — i 
versus it — g colours of a subset of 50,000 models in the spectral 
library. Together with the fact that, even in galaxies with strong 
emission lines, broadband colours do not allow the distinction be- 
tween different line ratios, this also implies poor constraints on the 
gas-phase oxygen abundance and the dust parameters fy and fi in 



Fig. [6] The retrieved likelihood distributions of these parameters 
are largely driven by the prior distributions of Fig. [4] 

The results of Fig. [6] are consistent with those of Wuyt s et al.| 
(2009), who explored the recovery of physical parameters from 
broadband spectral energy distributions of model galaxies taken 
from hydrodynamical merger simulations. Our study differs from 
theirs in that we consider much wider ranges of star formation and 
chemical enrichment histories and dust properties to retrieve likeli- 
hood distributions of physical parameters. Also, |Wuyts et al.j ( 2009 > 
neglect the contribution by nebular emission to broadband fluxes 
and include 1 1 bands to sample the rest-frame spectral energy dis- 
tributions of galaxies over a wide range of wavelengths, between 
roughly 0.15 and 3/im. Finally, they analyze galaxy simulations 
in the redshift range 1.5 ^ z ^ 2.9, which span a much nar- 
rower range of stellar population ages than the pseudo-galaxies in 
our sample. For all these reasons, [Wuyts et al.|p009) find smaller 
typical uncertainties in the retrieved stellar mass (~ 0.10 dex) and 
star formation rate (~ 0.31 dex) than can be inferred from the con- 
straints on M»/L r and tps in Fig. [6] They also find much tighter 
constraints on the dust attenuation optical depth (~ 0.31). [Wuyts| 
et al. (20091 note that using simple (single-burst, constant and ex- 
ponentially declining) star formation histories to fit the results of 
more realistic hydrodynamical merger simulations can induce sig- 
nificant deviations between the retrieved maximum-likelihood and 
true parameters values. Our adoption of the sophisticated galaxy 
spectral library assembled in Section |2~3l should minimize this type 
of limitation when applying our approach to the interpretation of 
real observations (Section[4|. 

We now return to our investigation and examine the sensi- 
tivity of the results of Fig. [6] to the adopted signal-to-noise ratio, 
the photometric wavelength coverage and the inclusion of nebu- 
lar emission when interpreting broadband observations of galaxies. 
To quantify potential differences in the retrieved likelihood distri- 
butions relative to the standard case, we define the improvement 
factor as the ratio 

uncertainty in the standard case 
l a — —. - • (3.13) 

uncertainty 

This quantity can be computed for any true parameter value. It ex- 
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Figure 7. (a) g — i colour plotted against u — g colour for a subset of 
50,000 models from the galaxy spectral library assembled in Section [23] 
The models are colour-coded according to specific star formation rate, as 
indicated, (b) Same as (a), but without including the contribution by nebular 
emission to broadband fluxes. 



ceeds unity when the constraints are tighter (i.e. the uncertainty 
smaller) than in the standard case. We further define the gain in 
accuracy as the difference 

A = (accuracy in the standard case) — (accuracy) . (3.14) 

This quantity is positive when the retrieved best estimate is more 
accurate (i.e. the absolute difference between the retrieved best es- 
timate and true parameter value smaller) than in the standard case. 
In Fig. [5] we show the analog of Fig. [6] for 3 distinct alternatives 
to the standard case: adopting a signal-to-noise ratio S/N = 10 in- 
stead of 30 (top row); removing the constraints from the reddest 2 
photometric bands, i and z (middle row); and not including nebular 
emission in the model library used to analyze pseudo-observations 
(bottom row). In all cases, we show at the bottom of each panel the 



improvement factor l a and the gain in accuracy A as a function of 
true parameter value. 

Fig. [5] (top row) shows that reducing the signal-to-noise ratio 
from S/N = 30 to 10 makes the uncertainty in the retrieved M*/L T 
larger by about 15 percent and, at the extremities of the explored 
range, the accuracy between 10 and 25 percent worse. A more dra- 
matic difference relative to the standard case arises from the loss 
of the emission-line signal for actively star-forming galaxies. This 
causes a sharp drop in I a at high ips, implying poor constraints on 
the specific star formation rate over the entire explored range. Inter- 
estingly, Fig. [^(middle row) shows that the effect of removing the 
i- and z-band constraints at S/N = 30 is very similar to that of low- 
ering the signal-to-noise ratio of 5-band ugriz photometry. This is 
because the signatures of dust attenuation can be less well delin- 
eated from those of star formation from the ugr colours alone, even 
if the contamination of these broadband fluxes by strong emission 
lines can in principle be detected at high S/N (Fig.[6}. As expected, 
the other parameters, /sfh, 12 + log (O/H), fy and fi, can be even 
less well retrieved than in the standard case when the signal-to- 
noise ratio is lowered and when the i- and z-band constraints are 
removed. 

It is also of interest to quantify the importance of includ- 
ing nebular emission when interpreting broadband observations 
of galaxies. A comparison between the top and bottom panels of 
Fig. [7] shows that the contribution by [O II] emission to the it-band 
flux and that of H/3 and [O III] emission to the g-band flux can 
change the broadband u — g and g — i colours of galaxies with 
ips > lGyr -1 by several tenths of a magnitude. In Fig. [^(bot- 
tom row), we show the results of attempting to interpret the same 
pseudo-observations as in Fig.[6](which include the contribution by 
nebular emission to the ugriz fluxes) using a spectral library which 
neglects nebular emission. This spectral library was constructed in 
the same way as that assembled in Section [23] but ignoring the 
prescription for nebular emission in Section |2.2.2| The retrieved 
M*/L r in this case is very similar to that obtained in the stan- 
dard case, except for the slightly worse accuracy (A < 0) at very 
low M*/L r , consistent with the contamination of the r-band flux 
by Ha, [N II] and [S II] emission in actively star-forming galaxies. 
As anticipated from Fig. [7] models without nebular emission do 
not sample broadband colours well enough to allow the retrieval 
of meaningful constraints on for actively star-forming galax- 
ies. The other parameters, /sfh, 12 + log (O/H), fy and /i, are as 
poorly constrained as in the standard case. We conclude from Fig. [8] 
that high S/N, good wavelength coverage and accounting for nebu- 
lar emission are all important to tighten the constraints on specific 
star formation rate from broadband observations of galaxies, while 
the inclusion of nebular emission is less crucial to constrain stellar 
mass-to-light ratio. 

3.1.2 Galaxies at unknown redshift 

We have tested so far the retrievability of physical parameters from 
5-band ugriz photometry only for nearby galaxies. We now wish to 
explore the constraints achievable for galaxies at unknown redshift. 
|Ilbert et aI7| l |2006l > have shown that redshift can be best estimated 
from 5-band ugriz photometry out to z ~ 1.5. Photometric red- 
shift measurements of more distant galaxies require observations at 
longer wavelengths (e.g. |Ilbert et al.|20 09), which are beyond the 
scope of the present paper (see Pacifici et al., in preparation). We 
therefore restrict ourselves here to the retrievability of physical pa- 
rameters of galaxies from 5-band ugriz photometry at redshifts in 
the range < z < 1. 
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Figure 8. Same as Fig. [6] but for 3 distinct alternatives to the standard case: (top row) adopting a signal-to-noise ratio S/N = 10 instead of 30; (middle row) 
using constraints from only the ugr photometric bands instead of ugriz; and (bottom row) not including nebular emission in the model library used to analyze 
the sample of 10,000 pseudo-galaxies. In all cases, the improvement factor 7 CT and the gain in accuracy A (equations |3 . 1 3|j3. 1 4\ are shown as a function of 
true parameter value at the bottom of each panel to quantify differences in the retrieved likelihood distributions relative to the standard case of Fig. [6] 



We use an approach similar to that described in Section |2~3"] 
to generate a new library of 5 million spectral energy distributions 
of galaxies at random redshifts between and 1. For each of the 
500,000 star formation and chemical enrichment histories in the 
original library of Section |2~T| we first draw an observation red- 
shift z bs between and 1. Then, we proceed as outlined in Sec- 
tion |2.3| and draw 10 different realization of the evolutionary stage 
(i.e. galaxy age) at which the galaxy is looked at (uniformly in red- 
shift between z b s and z = 1.7, to better sample the parameter 



/sfh; see Section [2~T| and, for star-forming galaxies, of the cur- 
rent parameters ^>g, Z, log Uq, £d, fy and fi (using the distribu- 
tions in Table [TJ(. The resulting spectral library contains about 8 
percent early-type galaxies, i.e. slightly less than the 10 percent 
in the z — library assembled in Section |2.3| Fig. [9] shows the 
prior distributions of the physical parameters M*/L r , /sfh, ips, 
12 + log (O/H), fv and [i in this library. As in Fig. [4] in each panel, 
the shaded histogram shows the distribution including all galaxies, 
while the solid histogram shows the contribution by star-forming 




Figure 9. Prior distributions of the same physical parameters as in Fig. [4] 
for the 5 million galaxies at redshifts between and 1 in the spectral library 
assembled in Section [3.1.2| In each panel, the shaded histogram shows the 
distribution for all galaxies, while the solid histogram shows the contribu- 
tion by star-forming galaxies alone. Non-star-forming galaxies are off scale 
(at log ips = ~ 00 ) m panel (c) and do not contribute to the distributions of 
interstellar parameters in panels (d)-(f). 



galaxies alone. The prior distribution of the observer-frame abso- 
lute r-band stellar mass-to-light ratio in Fig. [9^ reaches larger val- 
ues than in Fig. [4^ because of the contribution by (dusty) high- 
redshift galaxies with large K corrections. The early evolutionary 
stage of high-redshift galaxies also implies a larger typical /sfh 
in Fig. [9J) than in Fig.[4j}. By construction, the distributions of the 
current physical parameters ips, 12 + log (O/H), fy and /1 are the 
same in Figs|4]and|9](from Table jj). 

By analogy with Section |3. 1 . 1 1 we generate pseudo- 
observations in the ugriz photometric bands of 10,000 galaxies 
in the new library. These pseudo-observations sample different 
wavelength ranges of the rest-frame spectral energy distributions 
of galaxies at different redshifts, from roughly 3000-10,000 A at 
-Zobs = to 1500-5000 A at z ohs = 1. We consider both high- 
quality (S/N = 30) and low-quality (S/N = 10) observations. Us- 
ing the rest of the 5 million models in the library, we can retrieve 
likelihood distributions of z bs for each of the 10,000 pseudo- 
galaxies, as well as likelihood distributions of A/„/L r , /sfh, V's, 
12 + log (O/H), T V and ^. 

In Fig. I lOfr, we plot the distribution of the difference between 
the retrieved best estimate (i.e. median) and true value of z bs, in 
units of 1 + .Zobs, for the 10,000 pseudo-galaxies in our sample. 
The results for S/N = 30 (shaded histogram) are in remarkable 
agreement with the a = 0.03 Gaussian uncertainty in this quantity 
quoted by |Ilbert et alj{2006) , who analyzed CFHTLS (S/N > 30) 
u*g'r'i'z' photometry of about 3000 galaxies with spectroscopic 
redshifts between 0.2 and 1.5 (dashed line). For S/N = 10, the 
retrieved best estimate of z Q bs in Fig.|10fi remains in good general 
agreement with the true value (solid histogram). To better illustrate 
the origin of the histograms in Figs |1 0ft, we show in Figs |1 0b and 
|10[ : the detail of the average retrieved probability density functions 
as a function of z Q bs for S/N = 30 and 10, respectively. 



Figure 10. (a) Distribution of the difference between the retrieved best es- 
timate (i.e. median) and true value of z bs, in units of 1 + z Q bs, for a sam- 
ple of 10,000 pseudo-galaxies at redshifts between and 1 observed in the 
ugriz photometric bands. The shaded and solid histograms show the distri- 
butions obtained for S/N = 30 and S/N = 10, respectively. For reference, 
a dashed line indicates the a = 0.03 Gaussian accuracy quoted by |Ilbert| 
|et al.H2006) , who analyzed CFHTLS (S/N > 30) u'g'r'i' z 1 photometry 
of about 3000 galaxies with spectroscopic redshifts between 0.2 and 1.5. (£>) 
Detail of the average retrieved probability density function plotted against 
z bs> f° r S/N = 30. The lines and shading have the same meaning as in 
Fig.|6] (c) Same as (b), but for S/N = 10. 



The retrieved probability density functions of the other physi- 
cal parameters of interest to us are difficult to interpret when aver- 
aged over galaxies at all redshifts between and 1, because of the 
different effects impacting the recovery of parameters in different 
redshifts ranges. For this reason, it is more instructive to explore the 
retrievability of M»/L r , /sfh, V's, 12 + log (O/H), tv and /1 for 
galaxies in different redshift ranges, for example, 0.2 < z bs < 0.4 
and 0.6 < jz D bs < 0.8. We redraw 10,000 spectral energy distribu- 
tions in each of these redshift ranges from the new galaxy spectral 
library, which we use to generate pseudo-observations in the ugriz 
photometric bands, for S/N = 30. As before, we assume that the 
redshift z b s of a galaxy is unknown a priori and use all models at 
redshifts between and 1 in the spectral library to retrieve likeli- 
hood distributions of physical parameters. According to Fig. |10b , 
Zobs can be retrieved with an uncertainty of roughly 0.05 in both 
redshift ranges at the adopted signal-to-noise ratio. 

Fig. [TT] shows the average probability density functions of 
physical parameters retrieved in this way for galaxies at redshifts 
0.2 < z obs < 0.4 (top row) and 0.6 < z D bs < 0.8 (bottom 
row). At low redshift, the constraints are similar to those obtained in 
Fig.|6]for galaxies at z = 0. The main difference is the extension of 
the M*/L r probability density functions to larger values, induced 
by the tail of high-redshift galaxies with large K corrections in the 
prior distribution (Fig.|9ji). This tends to worsen the typical uncer- 
tainty in Ah /L r (from 0. 19 to 0.22 dex) over the range sampled by 
Fig.|6] i.e. for log[(M»/L r )/(M Q /L r , )] < 0.4, while at larger 
M*/L r , the smaller number of models implies a significant bias 
in the retrieved likelihood distribution. Another noticeable differ- 
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Figure 11. Average probability density functions of the same 6 physical parameters as in Fig.[9]retrieved, using the Bayesian approach described in Section |2~4] 
from 5-band ugriz photometry with S/N = 30 of samples of 10,000 pseudo-galaxies at random redshifts: (top row) drawn in the redshift range 0.2 < z ^, s < 
0.4; and (bottom row) drawn in the redshift range 0.6 < z Q b s < 0.8. In each case, the redshift z Q ^ s of a galaxy is assumed not to be known a priori, and the 
probability density functions are computed using models at all redshifts between and 1 in the spectral library assembled in Section [3.1.2| In all panels, the 
lines and shading have the same meaning as in Fig. [6] 



ence caused by the absence of redshift information between Fig.|l 1| 
(top row) and Fig. [6] is the slightly worse uncertainty and bias in 
the retrieved specific star formation rate of actively star-forming 
galaxies. Fig. [TT] (bottom row) also shows that, at higher redshift, 
the loss of Ha information further worsens ips estimates in these 
galaxies. The constraints on M- M / L r are similar to those for low- 
redshift galaxies. In both redshift ranges in Fig.[TT| the parameters 
/sfh, 12 + log(0/H), Ty and fi are as poorly constrained as for 
z — galaxies in Fig. [6] 

We conclude that high-quality photometry in the ugriz bands 
can provide useful constraints on the mass-to-light ratio and spe- 
cific star formation rate of galaxies, especially at low redshift, but 
less so on the recent history of star formation and the enrichment 
in metals and dust of the interstellar medium. The lack of redshift 
information can significantly weaken these constraints. 

3.2 Spectroscopic constraints 

We now investigate how the constraints obtained above from 
multi-band photometry on the parameters M*/L r , /sfh, ips, 
12 + log (O/H), Ty and p, can be improved by appealing to dif- 
ferent types of spectroscopic observations. In Section [3,2.1| below, 
we first show that the equivalent widths of strong optical emission 
lines alone can provide useful clues about the specific star forma- 
tion rate and enrichment of the interstellar medium in star-forming 
galaxies. Then, in Sections |3.2.2| and |3.2.3| we explore in detail 
the constraints retrievable from low- and medium-resolution spec- 
troscopy, exploiting the ability with our approach to interpret simul- 



taneously the stellar and nebular emission from galaxies. Through- 
out this section, we assume that the galaxies have well-determined 
spectroscopic redshifts and perform all calculations at rest-frame 
wavelengths in the range between 3600 and 7400 A (Fig. [3}. 

3.2.1 Equivalent widths of strong optical emission lines 

The equivalent widths of optical emission lines reflect the ratio 
of young stars, which produce ionizing radiation, to older stars, 
which dominate the optical continuum radiation. These equiva- 
lent widths should therefore provide valuable information about 
the specific star formation rate and, if several lines are available, 
the gas-phase oxygen abundance and dust attenuation optical depth 
in star-forming galaxies. Since line equivalent widths do not pro- 
vide any information about the absolute stellar continuum emis- 
sion, they cannot be used to constrain the parameters M*/Z/ r nor 
/sfh - Similarly, line equivalent widths should not provide any use- 
ful constraint on the dust attenuation optical depth in the ambient 
ISM, /j,tv, which affects in similar ways emission lines emerg- 
ing from stellar birth clouds and the continuum radiation from old 
stars (equation |2.8} . We therefore focus here on the retrievability of 
the parameters tps, 12 + log (O/H) and the dust attenuation optical 
depth in stellar birth clouds, (1 — ju)tv, from observations of the 
equivalent widths of strong optical emission lines in star-forming 
galaxies. 

To proceed, we select a sample of 10,000 spectral energy 
distributions from the library assembled in Section |2.3| to com- 
pute pseudo-observations of the net emission equivalent widths 
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Figure 12. Average probability density functions of the specific star for- 
mation rate, ips, gas-phase oxygen abundance, 12 + log(0/H), and dust 
attenuation optical depth in stellar birth clouds, (1 — p)fv, retrieved, 
using the Bayesian approach described in Section [2~4] from the equiva- 
lent widths of optical emission lines in a sample of 10,000 star-forming 
pseudo-galaxies. The sample was extracted from the library of galaxy 
spectral energy distributions assembled in Section |2.3| assuming a me- 
dian signal-to-noise ratio per pixel S/N = 20 and requiring 3<r mea- 
surements of all line equivalent widths. (Top row) using the equivalent 
widths of [Oll]A3727; H/3; [O III] AA4959, 5007; [Nn]A6548 + Ho + 
[NII]A6584; and [S II]AA6716, 6731 at a spectral resolution of 50 A 
FWHM (R = 100 at A = 5000 A). (Bottom row) using the equivalent 
widths of [Oil] A3727; H/3; [Om]A4959; [Om]A5007; [Nn]A6548; Ha; 
[NII]A6584; [SlI]A6716 and [SlI]A6731 at a spectral resolution of 5 A 
FWHM (R = 1000 at A = 5000 A). In all panels, the lines and shading 
have the same meaning as in Fig. [6] 



of the following emission lines: [Oll]A3727; H/3; [OlIl]A4959; 
[Om]A5007; [Nn]A6548; Ha; [Nn]A6584; [Sn]A6716 and 
[Sll]A6731. We consider both the cases of low (50 A FWHM; 
R = 100 at A = 5000 A) and medium (5 A FWHM; R = 
1000 at A = 5000 A) spectral resolution and fix the median 
signal-to-noise ratio per pixel at S/N = 20. In the case of low 
spectral resolution, we treat as single emission features the line 
blends: [O III] AA4959, 5007; [N II] A6548 + Ha + [N II] A6584 and 
[S II]AA6716, 6731. We also require that, at a given spectral reso- 
lution, the net emission equivalent widths of all features be mea- 
sured at better than the 3 a level We apply the same requirement 
to the model spectra used to compute likelihood distributions of 
ips, 12 + log (O/H) and (1 — p)fv for each pseudo-galaxy. These 
requirements are filled by roughly 2.7 million out of the 5 mil- 
lion original models at low resolution, and 1.7 million models at 
medium resolution (where lines are not blended). 

Fig. [12] shows the average retrieved probability density func- 
tions of ips, 12 + log (O/H) and (1 - p)fv for the 10,000 star- 



5 To compute net emission equivalent widths in low-resolution (medium- 
resolution) spectra, we measure the continuum level as follows. We first de- 
spike the spectrum with a 9-pixel (11-pixel) median smoothing. We then 
mask 5-pixel-wide (9-pixel-wide) regions centred on the emission lines, 
through which we interpolate the spectrum. Finally, we smooth the spec- 
trum again and adopt the result as the continuum level. 



forming pseudo-galaxies in our sample, for low (top row) and 
medium (bottom row) spectral resolution. We find that, as expected, 
the equivalent widths of strong optical emission lines set tight con- 
straints on the specific star formation rate, ^>s . The uncertainty in 
the retrieved best estimate is only slightly worse at low (0.23 dex) 
than at medium (0.18 dex) spectral resolution. An advantage of 
medium spectral resolution is the possibility to constrain ips in 
more quiescent star-forming galaxies than at low spectral resolu- 
tion, for fixed S/N. According to Fig. 
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for S/N = 20, the limit 
under which the equivalent widths of all lines of interest cannot 
be measured at the 3a level is ips ~ 0.1 Gyr -1 at R — 100 and 
« 0.04 Gyr" 1 at R = 1000. 
Fig. [T2] (bottom row) further shows that, at medium spec- 
tral resolution, the equivalent widths of optical emission lines pro- 
vide tight constraints on the gas-phase oxygen abundance in star- 
forming galaxies. The uncertainty in the retrieved best estimate of 
12 + log (O/H) is only 0.02 at the high-metallicity end. It reaches 
0.17 at low 12 + log (O/H), because the rise in intrinsic emission- 
line intensity with decreasing metallicity (e.g., figure 4d of Chariot 
|& Lon ghetti 200 lj allows the inclusion of more dusty galaxies in 
the likelihood distributions. At low spectral resolution, the blending 
of the Ha and [N II] lines worsens considerably these constraints 
(Fig. [T2] top row). Moreover, the drop in emission-line intensity 
with increasing metallicity implies that 12 + log (O/H) can be re- 
trieved in more metal-rich galaxies at medium than at low spectral 
resolution, for fixed S/N. The constraints on the dust attenuation 
optical depth in stellar birth clouds in Fig.[T2]are also much tighter 
at medium than at low spectral resolution. At low spectral resolu- 
tion, the median retrieved uncertainty in (1 — p)fv is only about 
20 percent larger than at medium spectral resolution, but the best 
estimate is much more severely biased toward the prior expectation. 



3.2.2 Low-resolution spectroscopy 

We now turn to one of the most innovative parts of the present study 
enabled by the approach developed in Section [2] the retrieval of 
physical parameters from the simultaneous interpretation of strong 
absorption and emission features in low-resolution spectral energy 
distribution of galaxies over the entire rest-wavelength range from 
A = 3600 to 7400 A. As before, we extract 10,000 spectral energy 
distributions from the library assembled in Section [23] to compute 
pseudo-observations of galaxies at the resolution of 50 A FWHM 
(R — 100 at A = 5000 A) in this wavelength range. We adopt for 
the moment a median signal-to-noise ratio per pixel S/N = 20. We 
use the rest of the 5 million models in the library to retrieve likeli- 
hood distributions of physical parameters for each of these 10,000 
pseudo-galaxies. 

Fig.[T3](top row) shows the average retrieved probability den- 
sity functions of M,/L r , /sph, ips, 12 + log (O/H), fv and p for 
the 10,000 pseudo-galaxies in our sample. Also shown at the bot- 
tom of each panel are the factor of improvement in the retrieved 
uncertainty, I a , and the gain in accuracy, A, relative to the stan- 
dard case of Fig.[6|(equations |3.13f|3~T4l (. Fig.[T3](top row) reveals 
that a major improvement of low-resolution spectroscopy over rest- 
frame ugriz photometry is the ability to retrieve meaningful (i.e. 
unbiased) constraints not only on the stellar mass-to-light ratio and 
the specific star formation rate, but also on the recent star forma- 
tion history (/sfh) and the enrichment of metals and dust in the 
interstellar medium of galaxies [12 + log (O/H), fv and, to a lesser 
extent, p]. The gain in accuracy (A) reaches up to 30 percent of the 
explored dynamic range for 12 + log (O/H), 15 percent for fv and 
p and up to 20 percent for /sfh - This results from the ability to de- 
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Figure 13. Average probability density functions of the same 6 physical parameters as in Fig.[4]retrieved, using the Bayesian approach described in Section |2~4] 
from low-resolution optical spectra of a sample of 10,000 pseudo-galaxies. The spectra cover the wavelength range from A = 3600 to 7400 A at the resolution 
of 50 A FWHM (R = 100 at A = 5000 A) with median signal-to-noise ratio per pixel S/N = 20. (Top row) for a sample of 10,000 pseudo-galaxies extracted 
randomly from the spectral library assembled in Section |2~3| The improvement factor I a and the gain in accuracy A (equations |3 . 1 3|[3 . 1 4) are shown as a 
function of true parameter value at the bottom of each panel to quantify differences in the retrieved likelihood distributions relative to the standard case of 
Fig. [6] (Bottom row) for a sample of 10,000 star-forming pseudo-galaxies extracted from the same spectral library, with the requirement that the net Ha+[N II] 
emission equivalent width be greater than 5 A. In all panels, the lines and shading have the same meaning as in Fig. ml 



tect and interpret, even at low resolution, the strong Balmer absorp- 
tion features of intermediate-age stars and the emission-line signa- 
tures of interstellar gas. The median uncertainty in the retrieved 
fraction of current stellar mass formed in the last 2.5 Gyr amounts 
to about 0.23, that in 12 + log (O/H) about 0.28 and that in the total 
dust attenuation optical depth, fy, about 0.64. For M„/L r and tps, 
already constrained in a meaningful way by multiband photome- 
try, the improvement in uncertainty relative to the standard case 
amounts to a factor (J CT ) of between 1.5 and 3. 

A more in-depth investigation reveals that, in all panels in 
Fig.[T3](top row), the largest uncertainties pertain to galaxies with 
emission lines too weak to allow a distinction between the signa- 
tures of star formation history, metallicity and dust in the spectra. 
These are mainly galaxies with low specific star formation rate, 
high metallicity and strong attenuation of emission lines by dust 
in stellar birth clouds. In this context, it is interesting to explore 
how much the constraints tighten if we restrict the sample to low- 
resolution galaxy spectra exhibiting unambiguous emission lines. 
We appeal again to the library assembled in Section|23]to generate 
a sample of 10,000 pseudo-observations of low-resolution galaxy 
spectra, for S/N = 20, with the requirement that the net Ha+[N II] 
emission equivalent width be greater than 5 A (throughout this pa- 
per, we adopt the convention of positive equivalent widths for emis- 
sion lines). Fig. [T3] (bottom row) shows that, in this case, the con- 
straints on the interstellar parameters 12 + log (O/H) and fy im- 
prove significantly. The corresponding median uncertainties both 



drop by about 20 percent. The constraints on ips improve mainly 
at the low end of the explored range (the uncertainty dropping by 
about 15 percent at ips ;$ 0.1 Gyr -1 ), where the omission of galax- 
ies with weak emission lines is most significant. The improvement 
is only marginal for AI,/L r , /sfh and fi, which are constrained 
primarily by stellar continuum and absorption-line features rather 
than by nebular emission lines. 

The ability with our approach to exploit strong absorption and 
emission features in low-resolution galaxy spectra depends criti- 
cally on the quality of the observations. To illustrate this, we show 
in Fig. [14] (top row) the analog of Fig. |13|(t op row) when adopting 
a median signal-to-noise ratio per pixel S/N = 5 instead of 20. The 
reduced ability to interpret spectral features at S/N = 5 makes the 
median uncertainties in the different retrieved parameters increase 
by 30 percent (for M»/L r , /sfh, m) up to 90 percent (for ips) rel- 
ative to the case S/N = 20. It also introduces substantial biases in 
the retrieved likelihood distributions. A comparison of Fig.[T4|(top 
row) with Fig. [6] reveals that, in fact, low-resolution spectroscopy 
with S/N = 5 over the wavelength range from A = 3600 to 7400 A 
provides hardly better constraints on the different parameters than 
high-quality ugriz photometry (as confirmed by the fact that I a is 
close to unity and A close to zero in Fig.[T3] top row). The con- 
straints on ^>s and 12 + log (O/H) are somewhat tighter because 
of the influence of actively star-forming galaxies with strong emis- 
sion lines on the average retrieved probability density functions. 
The star formation history parameter is only slightly better con- 
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Figure 14. Same as Fig. [l3] but: (top row) adopting a median signal-to-noise ratio per pixel S/N = 5 instead of 20; (bottom row) not including nebular 
emission in the model library used to analyze the sample of 10,000 pseudo-galaxies. In both cases, the improvement factor l a and the gain in accuracy A 
(equations |3. 1 3|f3~. 14) are shown as a function of true parameter value at the bottom of each panel to quantify differences in the retrieved likelihood distributions 
relative to the standard case of Fig. [6] 



strained, as even strong Balmer absorption lines are difficult to in- 
terpret at low S/N. The constraints on fv are as poor as inferred 
from multiband photometry, reflecting the inability to extract useful 
information from the Ha/H/3 ratio in low-quality spectra. In Fig.|14| 
(top row), the constraints on M* / L r are actually weaker and more 
severely biased at the extremities of the explored range than those 
obtained from rest-frame ugriz photometry in Fig. [6] This suggests 
that high-quality photometry including the near-infrared iz bands 
can better constrain galaxy stellar masses than low-resolution opti- 
cal spectroscopy with S/N = 5. 

The inclusion of nebular emission also has a crucial influ- 
ence on the ability to constrain galaxy physical parameters from 
low-resolution optical spectroscopy. Fig. [14] (bottom row) shows 
the results obtained when attempting to interpret the same pseudo- 
observations as in Fig. [T3] (top row; which include nebular emis- 
sion) with the library of models not including nebular emission al- 
ready used in Fig. [S] (bottom row)[^]ln this case, the average re- 

B To avoid artefacts in the resulting probability density functions, we re- 
move prominent emission lines from the pseudo-galaxy spectra before com- 
puting the likelihood of models without nebular emission (equation |2.10) . 
To achieve this, we first measure the continuum level across the whole range 
from A = 3600 to 7400 A, as described in footnote]?] Then, we reset to the 



trieved likelihood distributions of all parameters are severely de- 
graded relative to Fig.[T3](top row). In fact, for most parameters, the 
constraints are worse than those derived in Fig.[6]from high-quality 
ugriz photometry when accounting for the effect of nebular emis- 
sion on photometric fluxes. Fig.[l4](bottom row) shows that, for ex- 
ample, using models which neglect the contamination of the r-band 
flux by Ha+[N II] and [S II] line emission biases M* /L r estimates 
upward for star-forming galaxies. Moreover, ignoring the contami- 
nation of stellar Balmer absorption features by nebular emission in 
low-resolution spectra causes systematic underestimates of /sfh in 
star-forming galaxies. For actively star-forming galaxies, the con- 
straints on ips in Fig. [14] (bottom row) are weaker than the rough 
constraints derived in Fig. [6] from the influence of nebular emis- 
sion on ugriz photometric observations. Not surprisingly, mod- 
els without nebular emission do not allow the extraction of useful 
constraints on the interstellar parameters 12 + log (O/H), fy and yt 
from low-resolution galaxy spectra. 

Hence, we find that important constraints can be obtained 
on the stellar and interstellar parameters of galaxies from low- 
resolution spectra at optical wavelengths. This however requires 

continuum level the flux of every pixel exceeding the continuum flux by 
more than 5 times the noise level. 
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Figure 15. Average probability density functions of the same 6 physical parameters as in Fig.[4]retrieved, using the Bayesian approach described in Section |2~4] 
from medium-resolution optical spectra of a sample of 10,000 pseudo-galaxies. The spectra cover the wavelength range from A = 3600 to 7400 A at the 
resolution of 5 A FWHM (R = 1000 at A = 5000 A) with median signal-to-noise ratio per pixel S/N = 20. {Top row) for a sample of 10,000 pseudo-galaxies 
extracted randomly from the spectral library assembled in Section |2~3| The improvement factor I a and the gain in accuracy A (equations |3. 1 3|(3". 14) are shown 
as a function of true parameter value at the bottom of each panel to quantify differences in the retrieved likelihood distributions relative to the standard case 
of Fig. [6] (Bottom row) for a sample of 10,000 star-forming pseudo-galaxies extracted from the same spectral library, with the requirement that the net Ha 
emission equivalent width be greater than 5 A. In all panels, the lines and shading have the same meaning as in Fig. ml 



high-quality observations as well as the inclusion of nebular emis- 
sion in the spectral analysis. 



3.2.3 Medium-resolution spectroscopy 

Medium-resolution spectroscopy should provide even more insight 
into the physical properties of galaxies than low-resolution spec- 
troscopy, because composite features such as the [Om] doublet, 
the Ha+[N II] blend and the [S II] doublet can be resolved into in- 
dividual emission lines. To investigate this, we extract 10,000 spec- 
tral energy distributions from the library assembled in Section [23] 
and compute pseudo-observations of galaxies at the resolution of 
5 A FWHM (R = 1000 at A = 5000 A) over the wavelength range 
from A = 3600 to 7400 A. We adopt a median signal-to-noise ratio 
per pixel S/N = 20. As before, we use the rest of the 5 million 
models in the library to retrieve likelihood distributions of physical 
parameters for each of these 10,000 pseudo-galaxies. 

In Fig. [T3] (top row), we show the average probability den- 
sity functions of M*/L r , /sfh, ips, 12 + log(0/H), fy and p re- 
trieved from the medium-resolution spectra of the 10,000 pseudo- 
galaxies in our sample. The factor l a and the gain A quantifying 
the improvements in the retrieved uncertainty and accuracy relative 
to the standard case of Fig.|6|(equations |3.13f[3~T4 i are also shown 
at the bottom of each panel. A comparison of Fig.|15|(top row) with 
Fig.[T3](top row) shows that an increase in spectral resolution by a 
factor of 10 produces significantly tighter constraints on all param- 



eters. The improvement is most significant for the specific star for- 
mation rate, the gas-phase oxygen abundance and the total dust at- 
tenuation optical depth, for which the median retrieved uncertainty 
drops by about 45 percent from low to medium spectral resolution. 
The gain is more modest, of the order of 25 percent, for M*/L r , 
/sfh, and p. The refined spectral information available at medium 
resolution also help reduce some biases in the retrieved best esti- 
mates of /sfh (for young galaxies), -0s (for quiescent star-forming 
galaxies) and /i (across most of the explored range) in Fig.|15|(top 
row) relative to Fig.[T3](top row). 

By analogy with Fig. [T3] we further show in Fig. [15] (bot- 
tom row) the average probability density functions obtained when 
selecting a sample of 10,000 pseudo-observations of medium- 
resolution galaxy spectra, for S/N = 20, with the requirement that 
the net Ha emission equivalent width be greater than 5 A. The re- 
trieved likelihood distributions of 12 + log (O/H) and fv are sig- 
nificantly narrower than when including also galaxies with weak 
emission lines in the sample, with median retrieved uncertainties of 
about 0.08 for 12 + log (O/H) and 0.30 for fy. As at low spectral 
resolution, restricting the sample to galaxies with prominent emis- 
sion lines reduces the uncertainties in mainly at the low end 
of the explored range. Changes in the retrieved likelihood distribu- 
tions of M,/L r , /sfh and p are marginal, because these parame- 
ters are constrained primarily by stellar continuum and absorption- 
line features. 

We have also investigated how the constraints obtained in 
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Fig.[l5]from the analysis of medium-resolution galaxy spectra de- 
pend on the assumed signal-to-noise ratio of the observations and 
the inclusion of nebular emission in the modeling (not shown). We 
find that adopting a median signal-to-noise ratio per pixel S/N = 5 
instead of 20 for the 10,000 pseudo-galaxy spectra makes the me- 
dian retrieved uncertainties in most parameters increase by 30 to 50 



percent (only 15 percent for M*/L r ) relative to Fig. 15 (top row) 



The biases in the retrieved best estimates are less severe than found 
in Fig. [14] (top row) at low spectral resolution. As expected from 
Fig. 1 14| (bottom row), accounting for nebular emission also has a 
critical influence on the retrievability of the specific star formation 
rate and the interstellar parameters from medium-resolution galaxy 
spectra. It is worth pointing out that, at medium (and high) spectral 
resolution, prominent nebular emission lines can be excised from 
the spectra and analyzed separately from the stellar continuum and 
absorption-line signatures if no tool is available to analyze the stel- 
lar and nebular emission simul taneously (e.g., |K auffmann ~et al.| 
2003a] IBrinchmann et al.|2004| |Gallazzi et al.|2005| >. This is not 
the case at low spectral resolution, where the analysis of the stel- 
lar and nebular emission must be combined to obtain meaningful 
constraints on galaxy physical parameters. 



4 APPLICATION TO A REAL SAMPLE 

The conclusions drawn in the previous section about the retriev- 
ability of galaxy physical parameters from different types of obser- 
vations at optical wavelengths rely on sophisticated simulations of 
galaxy spectral energy distributions. As emphasized in Section |2T4] 
such pseudo-observations are required to test the retrievability of 
physical parameters from observables, because the physical param- 
eters of real galaxies cannot be known a priori. In this section, 
we apply our approach to the interpretation of observed medium- 
resolution spectra of galaxies from the SDSS DR7. We compare 
the results obtained through the simultaneous interpretation of stel- 
lar and nebular emission (at the resolving power R ~ 1000 con- 
sidered in Section [3.2.3| l with results obtained for the same galaxies 
from previous separate analyses of nebular emission-line and stellar 
absorption-line features (at the native resolving power R ~ 2000 
of the SDSS observations). We also investigate the influence of the 
adopted prior distributions on the retrieved physical parameters of 
the galaxies. 



4.1 Physical parameters of SDSS galaxies 

The SDSS DR7 contains about 1 million galaxy spectra covering 
the wavelength range from 3800 to 9200 A (observer frame) at a 
spectral resolution of roug hly 3 A FWHM. Of these, we extract a 
subsample of 12,660 star-forming galaxies with high-quality spec- 
tra as follows. We first reject any duplicate spectrum of a same 
galaxy, all spectra of merger candidates (i.e. corresponding to spa- 
tial separations less than 1.5" and redshift separations less than 
200 km s -1 ) and all galaxies at redshift less than z — 0.03 (to 
ensure that the spectrum includes the [Oll]A3727 doublet) from 
the original sample. Among the remaining 829,570 galaxies, we 
select all emission-line galaxies with signal-to-noise ratio greater 
than 10 in [On]A3727, H/3, [Om]A5007, Ha, [Nn]A6584 and 
[S II]A6716. We reject AGNs by applying the conservative crite- 
rion of Kauffma nn et al.| ( |2003b| > in the standard Baldwi n et al.| 
{1981} line-diagnostic diagram (see Fig. [2^ above). The resulting 
28,075 star-forming galaxies have a mean redshift of 0.08 ± 0.05. 



We exclude all galaxies at redshift z > 0.15 to keep the sample ho- 
mogeneous and further select galaxies with median signal-to-noise 
ratio per pixel in the range 5 < S/N < 25. This leaves 23,213 
spectra of galaxies at a mean redshift of 0.07 ± 0.03, all of which 
have net Ha emission equivalent widths greater than 5 A. We iden- 
tify 12,660 galaxies in this sample for which good measurements of 
stellar mass, star formation rate, gas-phase oxygen abundance and 
dust attenuation optical depth are available from previous studies 
(see next paragraph). For the purpose of illustrating the usefulness 
of our approach to interpret galaxy spectra, we focus on the galaxy 
physical properties probed by the 3-arcsec-diameter SDSS fibers 
and do not consider any photometry-based aperture correction. 

Estimates of star formation rate, gas-phase oxygen abundance 
and dust attenuation optical depth are available for the 12,600 
SDSS galaxies in this sample from the work of Brinchmann et al. 
(20041 and Trem onti et al.| (12004). These estimates were obtained 
by fitting the fluxes of the [Oll]A3727, H/3, [OlIl]A5007, Ha, 
[Nll]A6584 and [S II]A6716 emission lines (measured to high pre- 
cision by adjusting a stellar population model to the spectral con- 
tinuum) with a large library of Chariot & Longhetti (2001) nebu- 
lar emission models. In addition, stellar- mass estimates are avail- 
able for all galaxies from the work of Gall azzi et al.| ( [2003) , who 
fitted the strengths of the 4000 A break and the H5a+H7a, 
[MgFe]' and [Mg 2 Fe] absorption-line features with a large library 
of |BruzuaT & Chariot (2003) stellar population synthesis mod- 
els. The combination of these stellar-mass estimates with the star- 
formation-rate estimates of Brinchmann et al. ( 2004 ) provides esti- 
mates of the specific star formation rate probed by the SDSS fiber. 

The above physical parameters were derived from spectral 
analyses at the original SDSS resolution of about 3 A FWHM. To 
exemplify the power of the approach developed in Section [2] and 
for the purpose of comparison with the results of Section |3.23| we 
degrade the SDSS spectra to a resolution of 5 A FWHM, adopt- 
ing as before a fixed pixel size of 2.5 A. Also, we focus on the 
rest-frame wavelength range from 3700 to 7400 A, which does not 
extend as much in the ultraviolet as that considered in Section [3,2,3l 
(Fig. [3^), but which does include the [Oll]A3727 emission dou- 
blet. By analogy with the analysis of pseudo-observations in Sec- 
tion [3723] we use the 5 million models in the spectral library as- 
sembled in Section |2~3"1 to retrieve likelihood distributions of phys- 
ical parameters from the simultaneous analysis of the stellar and 
nebular emission (equations |2. 10}f2. 12[ > for each SDSS galaxy in 
the sample. We note that, when computing likelihood distributions 
for a given galaxy, we exclude from the library all models older 
than the age of the Universe at the redshift of that galaxy. 

In Fig. [T6] we show how the probability density functions of 
stellar mass, gas-phase oxygen abundance, total V^-band attenua- 
tion optical depth of the dust and specific star formation rate re- 
trieved from our analysis compare with those obtained in previous 
separate studies of the stellar and nebular emission at higher spec- 
tral resolution. Fig. |16f i shows that, for example, our estimates of 
M* (obtained by multiplying M*/L r by the absolute galaxy lu- 
minosity in the observed r band) agree well with those derived by 
|Gallazzi ~a l. ( 2005 ) from the analysis of selected absorption-line 
indices. The slightly lower values derived here (~ 0.1 dex) are at- 
tributable to the updated stellar population synthesis prescription 
(Section |2.2.1} , We also find good agreement in Fig. 1 16b between 
our estimates of 12 + log(0/H) and those obtained by |Tremonti| 
|et al.| {2004) from the analysis of selected emission-lines lumi- 
nosities. The discrepancies at 12 + log(0/H) < 8.7 are consis- 
tent with the uncertainty expected at low metallicity from inter- 
pretations of medium-resolution galaxy spectra (Fig. [T5] bottom 
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Figure 16. Estimates of physical parameters retrieved from the spectra of 12,660 SDSS star-forming galaxies (degraded to a resolution of 5 A FWHM) using 
the Bayesian approach described in Section [2~4] plotted against estimates of the same parameters from different sources, as indicated, (a) Stellar mass, M*. (b) 
Gas-phase oxygen abundance, 12 + log (O/H). (c) Total V-band attenuation optical depth of the dust, fy. (d) Specific star formation rate, -0s. In each panel, 
the likelihood distributions from previous studies were reconstructed from the published 16th, 50th and 84th percentiles and combined with the likelihood 
distributions obtained in this work to generate 2D probability density functions. The contours depict the normalized co-added 2D probability density function 
including all 12,660 galaxies in the sample (on a linear scale 25 levels, the outer edge of the lightest grey level corresponding to the 96th percentile of the 2D 
distribution). The solid line is the identity relation. 



row). In Figs[T6p and |16[i, the likelihood distributions retrieved 
from our analysis favor systematically lower fy and ips than de- 
rived by |Brinchmann et al.| l |2004| from the analysi s of emission- 
line luminosities. This is because Brinchmann et al. ( 2004 1 adopt a 
shallower dust attenuation curve (f\ oc A _u y both in stellar birth 
clouds and in the ambient ISM) than that resulting typically from 
equations | |2.7} -| |2T9} . This makes the inferred dust optical depth 
and the correction for obscured star formation smaller at fixed ob- 
served reddening. 

Fig. [16] therefore shows that the likelihood distributions of 
M^, 12 + log (O/H), fy and ife retrieved using our approach agree 
well (modulo some understood offsets) with those obtained in pre- 
vious separate analyses of the stellar and nebular emission of SDSS 
galaxies at twice higher spectral resolution. Such an agreement is 
expected in part from the fact that these previous analyses rely 
on earlier versions of the same stellar population synthesis code 
and nebular emission models as adopted here (Bruzual & Char- 
|lot|2003[|Charlot & Longhetti|2001[ see Sections [2TT] and |2~2~2) . 
What is most remarkable is the demonstrated ability with our ap- 
proach to interpret simultaneously the stellar and nebular emission 
from galaxies without having to excise nebular emission lines from 
noisy spectra. As shown in Section [3] this ability extends to low- 
resolution galaxy spectra and even to multi-band photometry. 

It is of interest to see how the fraction of the current stellar 
mass formed during the last 2.5 Gyr, which has not been investi- 
gated previously, relates to the other physical parameters of SDSS 
galaxies retrieved using our approach. In Fig. [T7] we show the 
2D probability density functions of /sfh versus logM*, logics, 
12 + log (O/H), T V and /i for the 12,660 SDSS star-forming galax- 



ies in our sample. The Spearman rank correlation coefficients (eval- 
uated using the median estimates of the different parameters) are 
r s — —0.27 for the relation between /sfh and log M* and +0.59 
for that between /sfh and log t[>s ■ For this sample size, both corre- 
lations are significant at better than the 5a level. In contrast, none 
of the interstellar parameters 12 + log (O/H), fy and /i correlates 
significantly with /sfh (we note in passing that the required high- 
S/N detection of 6 emission lines tends to select galaxies with low 
fv and high fi). Taken at face value, the results of Fig. |17| would 
suggest that the least massive galaxies, which are the most actively 
star-forming ones today, have also formed the largest proportion of 
stars over the last 2.5 Gyr. Such a trend would be consistent with 
the general expectation that, as galaxies grow through star forma- 
tion, the exhaustion of the gas supply makes the rate of star for- 
mation drop gradually (see also, e.g., fig. 22 of Brinchman n et al.| 
|2004[ l. However, this conclusion must be taken with caution, as the 
present sample is not complete and the SDSS fibers probe only the 
inner parts of galaxies. 

4.2 Influence of the prior distributions of physical 
parameters 

The constraints derived above on the stellar and interstellar prop- 
erties of SDSS galaxies, along with the analyses of pseudo- 
observations carried out in Section [3] rely on the adoption of the 
prior distributions of galaxy physical parameters in Fig. [4] As out- 
lined in Section |2~4| these distributions are designed to be as flat 
as possible to make the spectral library appropriate for the retrieval 
of physical parameters from observed spectral energy distributions 
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Figure 17. Two-dimensional probability density functions of the fraction of the current stellar mass formed during the last 2.5 Gyr, /sfh , versus stellar mass, 
Mi,, specific star formation rate, -0s, gas-phase oxygen abundance, 12 + log(0/H), total V-band attenuation optical depth of the dust, fy, and fraction of 
fy arising from dust in the ambient ISM, fi. All parameters were retrieved from the spectra of the same 12,660 SDSS star-forming galaxies as in Fig. |16| 
(degraded to a resolution of 5 A FWHM) using the Bayesian approach described in Section [2~4| In each panel, the contours depict the normalized co-added 2D 
probability density function including all 12,660 galaxies in the sample (on a linear scale of 25 levels, the outer edge of the lightest grey level corresponding 
to the 96th percentile of the 2D distribution). All physical quantities pertain to the regions of galaxies probed by the 3-arcsec-diameter SDSS fibers. 



of galaxies in wide ranges of physical properties. To illustrate the 
importance of this choice, we examine here the results obtained 
when adopting different prior distributions reflecting more specific 
properties of the galaxy population. As an example, we consider 
the prior distributions of galaxy physical parameters derived from 
the original semi-analytic recipes of |De Lucia & BlaizotH2007) . 

In Section [2T[ we used the semi-analytic recipes of |De Lu^| 
cia & Blaizot (2007 ) to generate the star formation and chemical 
enrichment histories of 500,000 galaxies extracted from the Mil- 
lennium cosmological simulation of Springel et al. ( 2005 ). The pa- 
rameters M*/L r , /sfh, ips and the interstellar metallicity Z are 
computed by default in this procedure. |De Lucia & Blaizot l j2007) 
do not provide any recipe to compute the gas-phase oxygen abun- 
dance nor the ionization parameter of the gas heated by young stars. 
Also, while attenuation by dust is included using the 2-component 
model of Chariot & Fall ( 2000 , adopting a fixed attenuation curve 
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both in stellar birth clouds and in the ambient ISM), 



we find that the recipe used by |De Lucia & B laizot ( 2007 ) to relate 
the V-band dust optical depth to the galaxy gas mass, metallicity 
and orientation produces typically much lower fy than observed 
in SDSS galaxies. To compute prior distributions of interstellar pa- 
rameters associated to the original semi-analytic recipes of |De Lu^| 
|cia & B laizot (2007 ), we therefore proceed as follows. For each of 
the 500,000 galaxies in the library, we draw 10 different realiza- 
tions of log Uo, £d and fi from the distributions in Table[TJ We also 
draw 10 different realizations of fy from a Gaussian distribution 
centred on 1.2 with a standard deviation of 0.6. This roughly repre- 
sents the global properties of SDSS galaxies for a fixed attenuation 
curve f\ oc A~ 0,7 in stellar birth clouds and in the ambient ISM 
(Brinchmann et al. 2004 ). The new spectral library contains 5 mil- 
lion models, which we can use to interpret observations of galaxy 
spectral energy distributions in the same way as the library assem- 
bled in Sectionl23l 



Fig. 
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shows the prior distributions of M,/L r , /sfh, tps, 
12 + log (O/H), fy and fi in this library at the mean redshift z = 
0.07 of the SDSS sample of Section |4~T) In each panel, the shaded 
histogram shows the distribution including all galaxies, while the 
thin solid histogram shows the contribution by star-forming galax- 
ies alone. As note d in Section|2.1| 35 p ercent of the galaxies in the 
original library of |De Lucia & Blaizot ( 2007] do not form stars at 
low redshift, while the distributions of ips and /sfh for the re- 
maining galaxies are narrow and characteristic of quiescent star 
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Figure 18. Prior distributions of the same physical parameters as in Fig. H] 
for the 5 million galaxies in the spectral library generated in Section ft. 2| 
These distributions are those arising, at the mean redshift z = 0.07 of the 
SDSS sample of Section [4~7| from the original semi-analytic post-treatment 
of the Millennium cosmological simulation of Springel et al. 1 2005} by |De| 
|Lucia & Blaizot ( 2007 see text for detail). In each panel, the shaded and 
thin solid histograms have the same meaning as in Fig. pj In panels (a), 
(c), (d) and (e), the thick solid histograms show the distributions of the 
best estimates of Af„/L r , -0s, 12 + log(0/H) and fy from Galla zzi et ah] 
(2005) , |Bnnchmann et aT1j2004) and |Tremonti et al.H2004> for the same 
sample of 12,660 SDSS star-forming galaxies as in Fig.|16| 



formation (Figs 1 18b and |18fc). As a result, most galaxies have rel- 
atively large M*/L r (Fig. |18fr). In Fig. 181, the narrow distribu- 
tion of 12 + log (O/H) arises from the correlation of the interstel- 
lar metallicity Z with ^>g for star-forming galaxies in this library. 
Also shown as thick solid histograms in Fig. [T8] are the distribu- 
tions of the best estimates of M*/ L r , ips, 12 + log (O/H) and fy 
from Gallazzi et al. ( 2005 ), Brinchmann et al. ( 2004 1 and Tremonti 
[eTallpOOl) for the 12,660 SDSS star-forming galaxies of Sec- 
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tion |4. 1 1 The required detection of 6 emission lines with S/N > 10 
in these galaxies implies that they are typically more actively star- 
forming (and hence have lower M*/L r ) and less dusty than the 
bulk of SDSS galaxies the Millennium cosmological simulation is 
expected to represent. This is the origin of the differences between 
the shaded and thick solid histograms in Fig. 1 1 8 1 

In Fig.[T9] we show the analog of Fig.[T6]obtained when using 
the prior distributions in Fig [18] instead of those in Fig. [4] to re- 
trieve the probability density functions of M*, 12 + log (O/H), fv 
and ips f°r the 12,660 SDSS star-forming galaxies in our sample. 
The upward bias in the prior distribution of M* / L r relative to the 
observed sample (Fig.|18fr) causes the estimates of M* in Fig.|19|t 
to be systematically larger than those derived by Gallaz zi et al.| 
(20051. In Fig. 1 19(1, the retrieved is about 60 percent smaller 
than that estimated by Brinchm ann et al.] j2004) , as in Fig. |16( 1. 
The offset in this case results from the downward bias in the prior 
distribution of ips (Fig.|18(:) rather than from differences in the dust 
attenuation curve, which is assumed to be the same by |De Lucia &| 
|Blaizot|{2"007] > and |Brinchmann et al.| ( |2004[>. M eanwhile, the up- 
ward bias in the prior distribution of fv (Fig. T8fc ) causes estimates 
of this para meter in Fig. 1 19}: t o be systematically larger than those 
derived by Brinchmann et al. ( 2004). In Fig. \19\ ), the estimates of 
12 + log (O/H) retrieved using the prior distribution of Fig. |18p 
agree well with those derived by Tremonti et al. 1 2004| >. 

The results of Figs [T6] and 19 illustrate the importance of 
the choice of appropriate prior distributions when applying the 
Bayesian approach described in Section |2T4] to retrieve physical pa- 
rameters from observed spectral energy distributions of galaxies. 
While the prior distributions of Fig. [18] are expected to be repre- 
sentative of the SDSS sample as a whole, adopting these distribu- 
tions to analyze subsamples of galaxies with properties different 
from the bulk can induce unwanted offsets in the retrieved physi- 
cal parameters. In cases where some specific physical property of 
the observed galaxies can be known in advance (e.g., actively star- 
forming, early-type, metal-poor), the use of appropriate restrictive 
prior distributions can help reduce the uncertainties in the retrieved 
parameters. When no advance information is available, it is prefer- 
able to appeal to flat (maximum-ignorance) prior distributions en- 
compassing wide ranges of galaxy physical properties. 



5 SUMMARY AND CONCLUSION 

We have developed a new approach to constrain galaxy physical 
parameters from the combined interpretation of stellar and nebular 
emission in wide ranges of photometric and spectroscopic obser- 
vations. Our approach relies on the simulation of a comprehensive 
library of 5 million galaxy spectral energy distributions using a set 
of state-of-the-art models of star formation and chemical enrich- 
ment histories, stellar population synthesis, nebular emission and 
attenuation by dust. This library can be used to retrieve probability 
density functions of physical parameters from the Bayesian anal- 
ysis of any type of multi-wavelength galaxy observation. We fo- 
cus in this paper on the retrievability of the observer-frame abso- 
lute r-band stellar mass-to-light ratio (Af„/L r ), the fraction of a 
current galaxy stellar mass formed during the last 2.5 Gyr (/sph), 
the specific star formation rate (ips), the gas-phase oxygen abun- 
dance [12 + log (O/H)], the total effective y-band absorption opti- 
cal depth of the dust (fy) and the fraction of this arising from dust 
in the ambient ISM (pt) from different types of observations at opti- 
cal wavelengths: 5-band ugriz photometry; equivalent-width mea- 
surements of strong emission lines; low-resolution spectroscopy 



(50 A FWHM over the wavelength range A=3600-7400 A); and 
medium-resolution spectroscopy (5 A FWHM over the same wave- 
length range). 

Since we cannot know the true properties of any sample of 
observed galaxies, assessing the retrievability of galaxy physical 
parameters from these different types of observations requires the 
simulation of pseudo-observations. We therefore simulate pseudo- 
observations by convolving the spectral energy distributions of 
models with known parameters with appropriate instrument re- 
sponses and then applying artificial noise to mimic true observa- 
tions. For each type of photometric or spectroscopic observation, 
we simulate 10,000 pseudo-observations in this way and use the 
other ~ 5 million models in the library to retrieve likelihood distri- 
butions of M*/L r , /sfh, ips, 12 + log (O/H), Ty and /j, for each 
pseudo-galaxy (using equations |2.10f|2.lT| l. Table [2] summarizes 
the main conclusions of our study. For each parameter and each 
type of observation, we list the median retrieved uncertainty (de- 
fined as half the 16th-84th percentile range of the retrieved likeli- 
hood distribution) and accuracy (defined as the absolute difference 
between the retrieved best estimate and true parameter value) de- 
termined from the analysis of the 10,000 pseudo-observations in 
Section|3] 

Several interesting conclusions can be drawn from Table [2] 
We find that, for example, rest-frame ugriz photometry can pro- 
vide roughly 50-percent uncertainty on M*/L r measurements, 
i.e. almost as tight as the 25- to 35-percent uncertainty provided 
by medium- and low-resolution optical spectroscopy (photomet- 
ric constraints worsen somewhat if the galaxy redshift is not 
known; see Fig. [TTJ. Meanwhile, optical spectroscopy is required 
to constrain /sfh through the strong absorption-line signatures 
of intermediate-age stars. Table [2] also shows that observations of 
the net emission equivalent widths of prominent optical lines are 
sufficient to constrain ips to within 50-70 percent (depending on 
the spectral resolution) and 12 + log (O/H) to within 0.05-0.23 in 
star-forming galaxies. Optical spectroscopy can help reduce these 
uncertainties and the potential small offsets in the retrieved best 
estimates of ips and 12 + log (O/H). In addition, spectroscopy is 
required to constrain both parameters in quiescent star-forming 
galaxies with weak emission lines (ips < 0.07 Gyr^ 1 ; see Fig. 12 1. 
Medium-resolution spectroscopy provides the best constraints on 
dust parameters in Table [2] with uncertainties of typically 0.35 in 
fv and 0.10 in pi (these include uncertainties in the optical proper- 
ties and spatial distribution of the dust as well as in the orientation 
of the galaxy; see Section [2,2.3[ >. At lower spectral resolution, the 
less distinct signatures of emission lines make the uncertainty in 
fv rise by about 80 percent, while the equivalent widths of optical 
emission lines at any spectral resolution constrain only the part of 
the attenuation arising from stellar birth clouds. 

We stress that the constraints in Table [2] rely critically on the 
ability with our approach to interpret simultaneously the stellar 
and nebular emission from galaxies. In the case of low-resolution 
spectroscopy, no reliable constraint can be obtained on any of the 
physical parameters in Table [2] (except for the mass-to-light ratio 
of early-type galaxies) when nebular emission is not included in 
the analysis (Fig.|14[l. The same is true at medium spectral resolu- 
tion, although if the resolution is high enough, prominent nebular 
emission lines can be excised from the spectra and analyzed sep- 
arately from the stellar continuum and absorption-line signatures. 
We find that the likelihood distributions of K'L, 12 + log (O/H), f v 
and ips retrieved from the combined analysis of stellar and nebular 
emission in the spectra of 12,660 SDSS star-forming galaxies (de- 
graded to a resolution of 5 A FWHM) using our approach agree 



22 C. Pacifici et al. 




9 10 11 

log(M./M ) [Gallazzi et al. 2005] 




1 2 
f v [Brinchmann et al. 2004] 



x 
o 



o 

O 




8.5 9.0 
12 + log(0/H) [Tremonti et al. 2004] 




-1 
^gC^s/Gy 1 ""') [Brinchmann et al. 2004] 



Figure 19. Same as Fig. |16| but using the prior distributions in Fig |18| instead of those in Fig. [4] to retrieve the probability density functions of M* 
12 + log (O/H), Ty and ipg for the 12,660 SDSS star-forming galaxies in the sample. 



Table 2. Summary of the constraints on galaxy physical parameters retrieved from different types of optical observations using the method developed in 
Section[2] For each parameter and each type of observation, the quoted uncertainty (defined as half the 16th-84th percentile range of the retrieved likelihood 
distribution) and accuracy (defined as the absolute difference between the retrieved best estimate and true parameter value) are median quantities determined 
from the analysis of 10,000 pseudo-observations in Section[3] 



Parameter 


Constraint 


Photometry™ 


Net emission-line EWs' 




Spectroscopy 6 








rest-frame ugriz 


R = 100 


R = 1000 


R = 


100 


R = 


1000 






All c 


SF d 


SF e 


All c 


SF^ 


All c 


SF« 


log[(Af./L r )/(M /L r , )] 


Uncertainty 


0.19 






0.13 


0.14 


0.10 


0.11 




Accuracy 


0.04 






0.01 


0.01 


0.00 


0.00 


/SFH 


Uncertainty 


0.36 






0.23 


0.23 


0.17 


0.19 




Accuracy 


0.17 






0.05 


0.05 


0.02 


0.03 


logOs/Gyr- 1 ) 


Uncertainty 


0.67 


0.23 


0.18 


0.24 


0.22 


0.12 


0.12 




Accuracy 


0.15 


0.12 


0.05 


0.01 


0.00 


0.00 


0.00 


12 + log (O/H) 


Uncertainty 


0.50 


0.23 


0.05 


0.28 


0.21 


0.13 


0.08 




Accuracy 


0.33 


0.11 


0.03 


0.04 


0.03 


0.00 


0.00 


TV 


Uncertainty 


0.91 


0A8 h 


0Al h 


0.64 


0.52 


0.35 


0.30 




Accuracy 


0.21 


0.57 h 


0.14 h 


0.05 


0.00 


0.00 


0.00 


/' 


Uncertainty 


0.20 






0.16 


0.15 


0.10 


0.10 




Accuracy 


0.10 






0.04 


0.02 


0.00 


0.00 



a For fixed signal-to-noise ratio S/N = 30 in all bands. 

6 For median signal-to-noise ratio per pixel S/N = 20 across the wavelength range A=3600-7400 A. 
c Averaged over all galaxy types. 

d For greater-than-3o- detections of [O n]A3727; H/3; [O m]AA4959, 5007; [Nn]A6548 + Ho + [N ll]A6584; and [S n]AA6716, 6731. 

e For greater-than-3cr detections of [O II]A3727; H/3; [OlII]A4959; [OlII]A5007; [N II]A6548; Ho; [NlI]A6584; [S II]A6716 and [SlI]A6731. 

f For galaxies with net Ho+[N II] emission equivalent width greater than 5 A. 

9 For galaxies with net Ho emission equivalent width greater than 5 A. 

h Constraint on the birth-cloud component of the attenuation only (i.e. t v c and not fy). 
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well (modulo some recent model improvements) with those ob- 
tained in previous separate analyses of the stellar and nebular emis- 
sion of these galaxies at twice higher sp ectral resolution jBrinch-| 
|mann et al.|2004| |Tremonti et al.|[2004l |Gallazzi et al.|2005| see 
Fig. 1 16}. Accounting for nebular emission is less crucial when de- 
riving constraints on M*/L r from ugriz photometry (Fig. [8] bot- 
tom row). 

Another important factor we can investigate in a straightfor- 
ward way using our approach is the influence of signal-to-noise 
ratio on the retrieval of galaxy physical parameters from different 
types of observations. For multiband photometry, we find that low- 
ering the signal-to-noise ratio from S/N = 30 to 10 makes the 
uncertainty in the retrieved M t /L r in Table [5] larger by about 15 
percent (Fig. [8p. For spectroscopy, adopting a median signal-to- 
noise ratio per pixel S/N = 5 instead of 20 makes the median un- 
certainties in the different retrieved parameters increase by about 
30 to 90 percent and introduces substantial biases in the retrieved 
likelihood distributions (Fig. 1 14}. The ability to make such assess- 
ments is valuable as it can help decide between different strategies 
when desi gning an observational campaign. For example, as noted 
in Section 



3.2.2 



low-resolution spectroscopy with S/N = 5 over 
the wavelength range from A = 3600 to 7400 A provides hardly 
better constraints on the different parameters in Table [2] than high- 
quality ugriz photometry (Figs [o] and |14|l. In contrast, the con- 
straints from low-resolution spectroscopy with S/N = 20 over the 
same wavelength range are generally far superior to those obtained 
from multiband photometry (Table [2j. 

It is important to keep in mind that the results reported in Ta- 
ble [2] about the retrievability of galaxy physical parameters from 
different types of observations correspond to the optimistic case 
in which the models we rely on are good approximations of true 
galaxies (Section \2.4\ . We have ensured this in the best possible 
way by appealing to state-of-the-art models including the most re- 
cent progress in galaxy spectral modeling to generate the library 
of spectral energy distributions on which our approach is based. 
With this in mind, the tool we have developed should be useful not 
only to interpret existing galaxy datasets, but also to design future 
observations. In principle, such observations can be of any photo- 
metric or spectroscopic type (or a combination thereof ) across the 
wavelength range covered by spectral evolution models. We have 
focused in this paper on a few examples of photometric and spec- 
troscopic observations at rest-frame optical wavelengths. In future 
work, we will present applications of our approach to the interpreta- 
tion of the ultraviolet and infrared emission from galaxies at various 
redshifts. The tool we have developed is intended to be made avail- 
able to the general astronomical community. In the meantime, we 
encourage colleagues interested in the application of our approach 
to the interpretation of specific galaxy observations to contact us. 
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